Update on the development branch #2131
Shixiaowei02
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we have pushed an update to the development branch (and the Triton backend) this Aug 20, 2024.
This update includes:
use_fused_mlp
toTrue
by defaultLogitsPostProcessorConfig
FinishReason
toResult
remove_input_padding
is enabled #1999)FORCE_NCCL_ALL_REDUCE_STRATEGY
is setperf-overview.md
Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions