Skip to content

Actions: vllm-project/vllm

Lint and Deploy Charts

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
898 workflow run results
898 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[V1] PR 1/N for v1 sample and prompt logprobs support
Lint and Deploy Charts #823: Pull request #9880 synchronize by afeldman-nm
December 18, 2024 07:22 8m 1s neuralmagic:afeldman-nm/v1_logprobs
December 18, 2024 07:22 8m 1s
[V1] TP Ray executor
Lint and Deploy Charts #821: Pull request #11107 synchronize by ruisearch42
December 18, 2024 06:42 7m 14s ruisearch42:v1_tp_raycg
December 18, 2024 06:42 7m 14s
[CI]add genai-perf benchmark in nightly benchmark
Lint and Deploy Charts #815: Pull request #10704 synchronize by jikunshang
December 18, 2024 05:28 7m 21s jikunshang:add_genai_perf
December 18, 2024 05:28 7m 21s
[Model] Whisper model implementation
Lint and Deploy Charts #814: Pull request #11280 opened by aurickq
December 18, 2024 03:46 7m 49s Snowflake-Labs:whisper
December 18, 2024 03:46 7m 49s
[Neuron][Kernel] NKI-based flash-attention kernel with paged KV cache
Lint and Deploy Charts #813: Pull request #11277 synchronize by liangfu
December 18, 2024 03:45 7m 32s liangfu:nki-flash-attn
December 18, 2024 03:45 7m 32s
[Neuron][Kernel] NKI-based flash-attention kernel with paged KV cache
Lint and Deploy Charts #812: Pull request #11277 synchronize by liangfu
December 18, 2024 03:39 8m 7s liangfu:nki-flash-attn
December 18, 2024 03:39 8m 7s
[Model] Add RWKV6 linear attention model
Lint and Deploy Charts #811: Pull request #11193 synchronize by harrisonvanderbyl
December 18, 2024 03:15 7m 4s harrisonvanderbyl:rwkv6
December 18, 2024 03:15 7m 4s
[MISC][XPU]update ipex link for CI fix
Lint and Deploy Charts #810: Pull request #11278 opened by yma11
December 18, 2024 03:02 7m 44s yma11:link-fix
December 18, 2024 03:02 7m 44s
LoRA Support for Ultravox model
Lint and Deploy Charts #809: Pull request #11253 synchronize by thedebugger
December 18, 2024 02:17 7m 41s thedebugger:svij-ultravox-lora-dec-16
December 18, 2024 02:17 7m 41s
[Neuron][Kernel] NKI-based flash-attention kernel with paged KV cache
Lint and Deploy Charts #808: Pull request #11277 opened by liangfu
December 18, 2024 02:00 7m 14s liangfu:nki-flash-attn
December 18, 2024 02:00 7m 14s
[Misc] Optimize ray worker initialization time
Lint and Deploy Charts #807: Pull request #11275 synchronize by ruisearch42
December 18, 2024 01:54 7m 37s ruisearch42:opt_ray_worker_init
December 18, 2024 01:54 7m 37s
[Kernel][LoRA]Punica prefill kernels fusion
Lint and Deploy Charts #806: Pull request #11234 synchronize by jeejeelee
December 18, 2024 01:49 7m 5s jeejeelee:punica-kernel-fusion
December 18, 2024 01:49 7m 5s
[Misc] Optimize ray worker initialization time
Lint and Deploy Charts #805: Pull request #11275 synchronize by ruisearch42
December 18, 2024 01:43 7m 18s ruisearch42:opt_ray_worker_init
December 18, 2024 01:43 7m 18s
[Misc] Optimize ray worker initialization time
Lint and Deploy Charts #804: Pull request #11275 opened by ruisearch42
December 18, 2024 00:59 7m 14s ruisearch42:opt_ray_worker_init
December 18, 2024 00:59 7m 14s
[CI][Misc] Remove Github Action Release Workflow
Lint and Deploy Charts #803: Pull request #11274 opened by simon-mo
December 18, 2024 00:58 7m 30s remove-gha-release-workflow
December 18, 2024 00:58 7m 30s
[CI] Adding CPU docker pipeline
Lint and Deploy Charts #802: Pull request #11261 synchronize by zhouyuan
December 18, 2024 00:51 8m 13s zhouyuan:wip_cpu_docker_pipeline
December 18, 2024 00:51 8m 13s
[Not for review] Basic benchmarking Ray initialization
Lint and Deploy Charts #801: Pull request #11272 synchronize by ruisearch42
December 18, 2024 00:11 8m 1s ruisearch42:i10283
December 18, 2024 00:11 8m 1s
[Not for review] Basic benchmarking Ray initialization
Lint and Deploy Charts #799: Pull request #11272 opened by ruisearch42
December 17, 2024 23:04 7m 45s ruisearch42:i10283
December 17, 2024 23:04 7m 45s