Skip to content

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #681

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1

Disable spec-decode + chunked-prefill for draft models with tensor parallelism > 1 #681

add-label-on-auto-merge

succeeded Nov 8, 2024 in 5s