[Feature] [Spec decode]: Enable MLPSpeculator/Medusa and prompt_logprobs
with ChunkedPrefill
#3733
Job | Run time |
---|---|
7m 6s | |
7m 6s |
prompt_logprobs
with ChunkedPrefill
#3733
Job | Run time |
---|---|
7m 6s | |
7m 6s |