Skip to content

Issues: NVIDIA/TensorRT-LLM

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

support for T4 triaged Issue has been triaged by maintainers
#2620 opened Dec 24, 2024 by krishnanpooja
4 tasks
SIGABRT while trying to build trtllm engine for biomistral model on T4 triaged Issue has been triaged by maintainers
#2619 opened Dec 24, 2024 by krishnanpooja
2 of 4 tasks
[Performance] What is the purpose of compiling a model? triaged Issue has been triaged by maintainers
#2617 opened Dec 24, 2024 by Flynn-Zh
4 tasks
Phi4 support? triaged Issue has been triaged by maintainers
#2616 opened Dec 24, 2024 by oscarbg
Performance of streaming requests is worse than non-streaming bug Something isn't working Investigating Performance Issue about performance number triaged Issue has been triaged by maintainers
#2613 opened Dec 24, 2024 by activezhao
2 of 4 tasks
How to suppress the WARNING logging?
#2610 opened Dec 24, 2024 by lxp3
Adding custom sampling config triaged Issue has been triaged by maintainers
#2609 opened Dec 23, 2024 by buddhapuneeth
1 of 4 tasks
Gemma 2 LoRA support Investigating Lora/P-tuning triaged Issue has been triaged by maintainers
#2606 opened Dec 21, 2024 by Aquasar11
[Feature Request] Better support for w4a8 quantization Investigating Low Precision Issue about lower bit quantization, including int8, int4, fp8 triaged Issue has been triaged by maintainers
#2605 opened Dec 20, 2024 by ShuaiShao93
SmoothQuant doesn't work with lora bug Something isn't working Investigating Lora/P-tuning triaged Issue has been triaged by maintainers
#2604 opened Dec 20, 2024 by ShuaiShao93
4 tasks
lora doesn't work with --use_fp8_rowwise bug Something isn't working
#2603 opened Dec 20, 2024 by ShuaiShao93
4 tasks
--use_fp8 doesn't work with llama 3.1 8b bug Something isn't working
#2602 opened Dec 20, 2024 by ShuaiShao93
4 tasks
No module named 'tensorrt_llm.bindings' bug Something isn't working
#2599 opened Dec 20, 2024 by WGS-note
2 of 4 tasks
[Performance] TTFT of qwen2.5 0.5B model bug Something isn't working
#2598 opened Dec 20, 2024 by ReginaZh
4 tasks
Unable to install TensorRT-LLM
#2597 opened Dec 20, 2024 by gowthamtupili
internVL with batch_size>1
#2591 opened Dec 19, 2024 by nzarif
ProTip! Type g i on any issue or pull request to go back to the issue listing page.