Skip to content

Issues: triton-inference-server/tensorrtllm_backend

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

problem with streaming bug Something isn't working
#640 opened Nov 9, 2024 by Alireza3242
2 of 4 tasks
Support non-detached mode for python trtllm backend bug Something isn't working
#639 opened Nov 6, 2024 by ShuaiShao93
4 tasks
the output of bls is unstable bug Something isn't working
#630 opened Oct 23, 2024 by dwq370
4 tasks
Streaming Inference Failure bug Something isn't working
#626 opened Oct 20, 2024 by imilli
2 of 4 tasks
The GPU memory usage is too high. bug Something isn't working
#625 opened Oct 19, 2024 by imilli
2 of 4 tasks
Garbage response when input tokens is longer than 4096 on Llama-3.1-8B-Instruct bug Something isn't working
#624 opened Oct 18, 2024 by winstxnhdw
2 of 4 tasks
Failed install in nvcr.io/nvidia/tritonserver:24.08-trtllm-python-py3 bug Something isn't working
#623 opened Oct 18, 2024 by wwx007121
4 tasks
make 2 instance.
#617 opened Oct 12, 2024 by Alireza3242
fill_template.py and gpu_device_ids bug Something isn't working
#616 opened Oct 12, 2024 by Alireza3242
2 of 4 tasks
Is ReDrafter supported by the TensorRT-LLM backend? bug Something isn't working
#610 opened Oct 5, 2024 by vkc1vk
2 of 4 tasks
Qwen2-14B inference garbled bug Something isn't working
#601 opened Sep 20, 2024 by kazyun
4 tasks
generation logits dtype bug bug Something isn't working
#598 opened Sep 11, 2024 by binhtranmcs
2 of 4 tasks
Can't build GPT-J 6B bug Something isn't working
#595 opened Sep 6, 2024 by coppock
2 of 4 tasks
Is no_repeat_ngram_size generation option supported? bug Something isn't working
#593 opened Sep 3, 2024 by ghost
2 of 4 tasks
Add Phi 3 vision multimodal support bug Something isn't working
#590 opened Aug 30, 2024 by iibw
ProTip! Exclude everything labeled bug with -label:bug.