Skip to content

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #134

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #134

codespell (3.12)

succeeded Nov 8, 2024 in 10s