Skip to content

Actions: vllm-project/vllm

Lint and Deploy Charts

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,840 workflow runs
3,840 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[V1] PR 1/N for v1 sample and prompt logprobs support
Lint and Deploy Charts #3816: Pull request #9880 synchronize by afeldman-nm
January 22, 2025 05:23 6m 59s neuralmagic:afeldman-nm/v1_logprobs
January 22, 2025 05:23 6m 59s
[Core] Support fully transparent sleep mode
Lint and Deploy Charts #3815: Pull request #11743 synchronize by youkaichao
January 22, 2025 05:22 7m 8s youkaichao:cumem
January 22, 2025 05:22 7m 8s
[Core] Make disaggregated prefill compatible with pipeline parallelism
Lint and Deploy Charts #3814: Pull request #12301 opened by YuhanLiu11
January 22, 2025 05:10 7m 5s YuhanLiu11:main
January 22, 2025 05:10 7m 5s
[Kernel] Flash Attention 3 Support
Lint and Deploy Charts #3813: Pull request #12093 synchronize by LucasWilkinson
January 22, 2025 04:46 7m 43s neuralmagic:lwilkinson/fa3
January 22, 2025 04:46 7m 43s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3812: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:30 7m 31s youkaichao:builder
January 22, 2025 04:30 7m 31s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3811: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:28 6m 47s youkaichao:builder
January 22, 2025 04:28 6m 47s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3810: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:23 6m 46s youkaichao:builder
January 22, 2025 04:23 6m 46s
[V1] PR 1/N for v1 sample and prompt logprobs support
Lint and Deploy Charts #3809: Pull request #9880 synchronize by afeldman-nm
January 22, 2025 04:21 7m 7s neuralmagic:afeldman-nm/v1_logprobs
January 22, 2025 04:21 7m 7s
[V1] PR 1/N for v1 sample and prompt logprobs support
Lint and Deploy Charts #3808: Pull request #9880 synchronize by afeldman-nm
January 22, 2025 04:21 7m 42s neuralmagic:afeldman-nm/v1_logprobs
January 22, 2025 04:21 7m 42s
[V1] PR 1/N for v1 sample and prompt logprobs support
Lint and Deploy Charts #3807: Pull request #9880 synchronize by afeldman-nm
January 22, 2025 04:14 7m 0s neuralmagic:afeldman-nm/v1_logprobs
January 22, 2025 04:14 7m 0s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3806: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:10 6m 58s youkaichao:builder
January 22, 2025 04:10 6m 58s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3805: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:05 7m 41s youkaichao:builder
January 22, 2025 04:05 7m 41s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3804: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:01 7m 5s youkaichao:builder
January 22, 2025 04:01 7m 5s
[Build] update requirements of no-device
Lint and Deploy Charts #3803: Pull request #12299 opened by MengqingCao
January 22, 2025 03:54 7m 0s MengqingCao:fix
January 22, 2025 03:54 7m 0s
[V1][Frontend] Coalesce bunched RequestOutputs
Lint and Deploy Charts #3802: Pull request #12298 opened by njhill
January 22, 2025 03:51 7m 12s njhill:coalesce-stream
January 22, 2025 03:51 7m 12s
[Ci/Build] Fix mypy errors on main
Lint and Deploy Charts #3801: Pull request #12296 opened by DarkLight1337
January 22, 2025 03:32 7m 38s DarkLight1337:fix-pre-commit
January 22, 2025 03:32 7m 38s
[CI/lint] Fix pre-commit
Lint and Deploy Charts #3800: Pull request #12295 opened by khluu
January 22, 2025 03:24 7m 7s khluu/fix_precommit2
January 22, 2025 03:24 7m 7s
[V1][Spec Decode] Ngram Spec Decode
Lint and Deploy Charts #3799: Pull request #12193 synchronize by LiuXiaoxuanPKU
January 22, 2025 02:50 6m 52s LiuXiaoxuanPKU:ngram
January 22, 2025 02:50 6m 52s
[Kernel] Pipe attn_logits_soft_cap through paged attention TPU kernels
Lint and Deploy Charts #3798: Pull request #12294 opened by fenghuizhang
January 22, 2025 02:07 7m 35s fenghuizhang:main
January 22, 2025 02:07 7m 35s
[Docs] Update FP8 KV Cache documentation
Lint and Deploy Charts #3797: Pull request #12238 synchronize by mgoin
January 22, 2025 02:00 8m 0s neuralmagic:updated-kv-cache-quant-docs
January 22, 2025 02:00 8m 0s
[Frontend][V1] Online serving performance improvements
Lint and Deploy Charts #3796: Pull request #12287 synchronize by njhill
January 22, 2025 01:39 7m 28s njhill:v1-perf-smoothing
January 22, 2025 01:39 7m 28s
[CI] add docker volume prune to neuron CI
Lint and Deploy Charts #3795: Pull request #12291 synchronize by liangfu
January 22, 2025 01:14 7m 5s liangfu:fix-volume-prune
January 22, 2025 01:14 7m 5s
[CI] add docker volume prune to neuron CI
Lint and Deploy Charts #3794: Pull request #12291 opened by liangfu
January 22, 2025 01:11 7m 38s liangfu:fix-volume-prune
January 22, 2025 01:11 7m 38s
[Benchmark] More accurate TPOT calc in benchmark_serving.py
Lint and Deploy Charts #3792: Pull request #12288 opened by njhill
January 22, 2025 00:17 7m 8s njhill:bench-tpot-tokens
January 22, 2025 00:17 7m 8s