Skip to content

WIP: [GPU] Use FP32 accumulator for QK multiplication for 2nd+ token calculation in PagedAttention #18555

WIP: [GPU] Use FP32 accumulator for QK multiplication for 2nd+ token calculation in PagedAttention

WIP: [GPU] Use FP32 accumulator for QK multiplication for 2nd+ token calculation in PagedAttention #18555

Annotations

1 warning

Pytorch Layer Tests  /  PyTorch Layer Tests

succeeded Jan 24, 2025 in 3m 13s