Search Results

Found 1 results for "c21045384c3c76a901b45edc0f077d02" across all boards searching md5.

Anonymous /g/105766280#105768194
7/1/2025, 7:45:06 PM
sageattn_qk_int8_pv_fp8_cuda: INT8 quantization for
Q
K

and FP8 for
P
V
using CUDA backend. (Note that setting pv_accum_dtype=fp32+fp16 corresponds to SageAttention2++.)
so how are we supposed to use sageattn2++?