Skip to content

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #27379

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #27379