-
-
Notifications
You must be signed in to change notification settings - Fork 4.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Misc] Use registry-based initialization for KV cache transfer connector.
#11481
opened Dec 25, 2024 by
KuntaiDu
Loading…
[Misc] Allow initializing KV cache transfer agent when using third-party library for disaggregated prefill
#11480
opened Dec 25, 2024 by
KuntaiDu
Loading…
Update amd-installation.md
ci/build
documentation
Improvements or additions to documentation
#11470
opened Dec 24, 2024 by
johnnynunez
Loading…
Update deploying_with_k8s.md with AMD ROCm GPU example
documentation
Improvements or additions to documentation
#11465
opened Dec 24, 2024 by
AlexHe99
Loading…
[Misc] Update disaggregation benchmark scripts and test logs
ready
ONLY add when PR is ready to merge/full CI is needed
#11456
opened Dec 24, 2024 by
Jeffwan
Loading…
[Frontend] improve hermes_tool_parser.py
ci/build
frontend
#11453
opened Dec 24, 2024 by
paulcx
Loading…
[Platform] More consistent entrypoints across different platforms
ci/build
#11448
opened Dec 24, 2024 by
terrytangyuan
Loading…
[Model][LoRA]LoRA support added for MolmoForCausalLM
ci/build
documentation
Improvements or additions to documentation
frontend
needs-rebase
#11439
opened Dec 23, 2024 by
ayylemao
Loading…
fix: add missing bos_token to example templates
ci/build
#11432
opened Dec 23, 2024 by
toslunar
Loading…
Bump helm/kind-action from 1.10.0 to 1.11.0
ci/build
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#11424
opened Dec 23, 2024 by
dependabot
bot
Loading…
[WIP][Doc]Add documentation for using EAGLE in vLLM
documentation
Improvements or additions to documentation
[V1] Optimize block table transfer from CPU to GPU
ci/build
#11401
opened Dec 22, 2024 by
WoosukKwon
•
Draft
[VLM] Support caching in merged multi-modal processor
ci/build
documentation
Improvements or additions to documentation
#11396
opened Dec 21, 2024 by
DarkLight1337
Loading…
[V1] Use FlashInfer Sampling Kernel for Top-P & Top-K Sampling
#11394
opened Dec 21, 2024 by
WoosukKwon
•
Draft
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.