forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 52
Pull requests: HabanaAI/vllm-fork
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add FP8 TP=2 scenario to Jenkins CI
enhancement
New feature or request
habana
Issues or PRs submitted by Habana Labs
#478
opened Nov 8, 2024 by
afierka-intel
•
Draft
[DO NOT MERGE] Upstream codebase diff
habana
Issues or PRs submitted by Habana Labs
#470
opened Nov 6, 2024 by
kzawora-intel
•
Draft
Add models-tiny CI step with Llama3.2-1B
habana
Issues or PRs submitted by Habana Labs
#440
opened Oct 28, 2024 by
kzawora-intel
•
Draft
Resolved alibi bias issue due to porting flat PA pr
#437
opened Oct 28, 2024 by
tannervoas742
Loading…
Add HPU information to collect_env script
habana
Issues or PRs submitted by Habana Labs
#430
opened Oct 25, 2024 by
michalkuligowski
Loading…
[PoC] Add max padding ratio to padding aware scheduler
habana
Issues or PRs submitted by Habana Labs
#407
opened Oct 18, 2024 by
kzawora-intel
•
Draft
Create run-lm-eval-mmlu.sh
habana
Issues or PRs submitted by Habana Labs
#399
opened Oct 16, 2024 by
michalkuligowski
•
Draft
WA for OOM in qwen 2 - sync after loading weights
habana
Issues or PRs submitted by Habana Labs
#398
opened Oct 16, 2024 by
michalkuligowski
Loading…
[bucketing overhaul 2/n] Delegate bucket management to HPUBucketingContext
habana
Issues or PRs submitted by Habana Labs
#395
opened Oct 15, 2024 by
kzawora-intel
Loading…
Add bucket calibration, allow reading/writing bucketing configs to file
habana
Issues or PRs submitted by Habana Labs
#345
opened Sep 27, 2024 by
kzawora-intel
Loading…
Optimize LoRA mask creation
habana
Issues or PRs submitted by Habana Labs
#285
opened Sep 13, 2024 by
SanjuCSudhakaran
•
Draft
[build] Changes for RH build
external
Issues or PRs submitted by external users
#190
opened Aug 15, 2024 by
Xaenalt
Loading…
ProTip!
Follow long discussions with comments:>50.