Skip to content

Pull requests: HabanaAI/vllm-hpu-extension

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add AWQ class
#29 opened Nov 8, 2024 by maktukmak Loading…
Add GPTQ class
#28 opened Nov 8, 2024 by maktukmak Loading…
do not use softmax fast mode in FusedSDPA
#26 opened Nov 5, 2024 by ccrhx4 Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.