Skip to content

decoder MMHA kernel support INT8 SCALE_Q_INSTEAD_OF_K and SCALE_P_INS…#2085

Open
lishicheng1996 wants to merge 1 commit intoNVIDIA:mainfrom lishicheng1996:decoder_attn_int8_kvcache_dequant_qp