Skip to content

WIP: [GPU] Use FP32 accumulator for QK multiplication for 2nd+ token calculation in PagedAttention#28673

Open
sshlyapn wants to merge 1 commit intoopenvinotoolkit:masterfrom sshlyapn:paged_attention_2nd_token_fp32_acc