Skip to content

Actions: vllm-project/vllm

Lint and Deploy Charts

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,811 workflow runs
3,811 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3812: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:30 7m 31s youkaichao:builder
January 22, 2025 04:30 7m 31s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3811: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:28 6m 47s youkaichao:builder
January 22, 2025 04:28 6m 47s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3810: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:23 6m 46s youkaichao:builder
January 22, 2025 04:23 6m 46s
[V1] PR 1/N for v1 sample and prompt logprobs support
Lint and Deploy Charts #3809: Pull request #9880 synchronize by afeldman-nm
January 22, 2025 04:21 7m 7s neuralmagic:afeldman-nm/v1_logprobs
January 22, 2025 04:21 7m 7s
[V1] PR 1/N for v1 sample and prompt logprobs support
Lint and Deploy Charts #3808: Pull request #9880 synchronize by afeldman-nm
January 22, 2025 04:21 7m 42s neuralmagic:afeldman-nm/v1_logprobs
January 22, 2025 04:21 7m 42s
[V1] PR 1/N for v1 sample and prompt logprobs support
Lint and Deploy Charts #3807: Pull request #9880 synchronize by afeldman-nm
January 22, 2025 04:14 7m 0s neuralmagic:afeldman-nm/v1_logprobs
January 22, 2025 04:14 7m 0s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3806: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:10 6m 58s youkaichao:builder
January 22, 2025 04:10 6m 58s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3805: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:05 7m 41s youkaichao:builder
January 22, 2025 04:05 7m 41s
[core] separate builder init and builder prepare for each batch
Lint and Deploy Charts #3804: Pull request #12253 synchronize by youkaichao
January 22, 2025 04:01 7m 5s youkaichao:builder
January 22, 2025 04:01 7m 5s
[Build] update requirements of no-device
Lint and Deploy Charts #3803: Pull request #12299 opened by MengqingCao
January 22, 2025 03:54 7m 0s MengqingCao:fix
January 22, 2025 03:54 7m 0s
[V1][Frontend] Coalesce bunched RequestOutputs
Lint and Deploy Charts #3802: Pull request #12298 opened by njhill
January 22, 2025 03:51 7m 12s njhill:coalesce-stream
January 22, 2025 03:51 7m 12s
[Ci/Build] Fix mypy errors on main
Lint and Deploy Charts #3801: Pull request #12296 opened by DarkLight1337
January 22, 2025 03:32 7m 38s DarkLight1337:fix-pre-commit
January 22, 2025 03:32 7m 38s
[CI/lint] Fix pre-commit
Lint and Deploy Charts #3800: Pull request #12295 opened by khluu
January 22, 2025 03:24 7m 7s khluu/fix_precommit2
January 22, 2025 03:24 7m 7s
[V1][Spec Decode] Ngram Spec Decode
Lint and Deploy Charts #3799: Pull request #12193 synchronize by LiuXiaoxuanPKU
January 22, 2025 02:50 6m 52s LiuXiaoxuanPKU:ngram
January 22, 2025 02:50 6m 52s
[Kernel] Pipe attn_logits_soft_cap through paged attention TPU kernels
Lint and Deploy Charts #3798: Pull request #12294 opened by fenghuizhang
January 22, 2025 02:07 7m 35s fenghuizhang:main
January 22, 2025 02:07 7m 35s
[Docs] Update FP8 KV Cache documentation
Lint and Deploy Charts #3797: Pull request #12238 synchronize by mgoin
January 22, 2025 02:00 8m 0s neuralmagic:updated-kv-cache-quant-docs
January 22, 2025 02:00 8m 0s
[Frontend][V1] Online serving performance improvements
Lint and Deploy Charts #3796: Pull request #12287 synchronize by njhill
January 22, 2025 01:39 7m 28s njhill:v1-perf-smoothing
January 22, 2025 01:39 7m 28s
[CI] add docker volume prune to neuron CI
Lint and Deploy Charts #3795: Pull request #12291 synchronize by liangfu
January 22, 2025 01:14 7m 5s liangfu:fix-volume-prune
January 22, 2025 01:14 7m 5s
[CI] add docker volume prune to neuron CI
Lint and Deploy Charts #3794: Pull request #12291 opened by liangfu
January 22, 2025 01:11 7m 38s liangfu:fix-volume-prune
January 22, 2025 01:11 7m 38s
[Benchmark] More accurate TPOT calc in benchmark_serving.py
Lint and Deploy Charts #3792: Pull request #12288 opened by njhill
January 22, 2025 00:17 7m 8s njhill:bench-tpot-tokens
January 22, 2025 00:17 7m 8s
[Frontend][V1] Online serving performance improvements
Lint and Deploy Charts #3791: Pull request #12287 synchronize by njhill
January 21, 2025 23:40 7m 21s njhill:v1-perf-smoothing
January 21, 2025 23:40 7m 21s
[Frontend][V1] Online serving performance improvements
Lint and Deploy Charts #3790: Pull request #12287 opened by njhill
January 21, 2025 23:38 7m 2s njhill:v1-perf-smoothing
January 21, 2025 23:38 7m 2s
[Core] Reduce TTFT with concurrent partial prefills
Lint and Deploy Charts #3789: Pull request #10235 synchronize by joerunde
January 21, 2025 23:35 7m 36s opendatahub-io:prefill-slots
January 21, 2025 23:35 7m 36s
[FEATURE] Enables offline /score for embedding models
Lint and Deploy Charts #3788: Pull request #12021 synchronize by gmarinho2
January 21, 2025 23:22 7m 51s gmarinho2:main
January 21, 2025 23:22 7m 51s