10.7.0 GA
For more information, see the TensorRT 10.7.0 release notes.
Key Feature and Updates:
-
Demo Changes
- demoDiffusion
- Enabled low-vram for the Flux pipeline. Users can now run the pipelines on systems with 32GB VRAM.
- Added support for FLUX.1-schnell pipeline.
- Enabled weight streaming mode for Flux pipeline.
- demoDiffusion
-
Plugin Changes
- On Blackwell and later platforms, TensorRT will drop cuDNN support on the following categories of plugins
- User-written
IPluginV2Ext
,IPluginV2DynamicExt
, andIPluginV2IOExt
plugins that are dependent on cuDNN handles provided by TensorRT (via theattachToContext()
API). - TensorRT standard plugins that use cuDNN, specifically:
InstanceNormalization_TRT
(version: 1, 2, and 3) present inplugin/instanceNormalizationPlugin/
.GroupNormalizationPlugin
(version: 1) present inplugin/groupNormalizationPlugin/
.- Note: These normalization plugins are superseded by TensorRT’s native
INormalizationLayer
(C++, Python). TensorRT support for cuDNN-dependent plugins remain unchanged on pre-Blackwell platforms.
- User-written
- On Blackwell and later platforms, TensorRT will drop cuDNN support on the following categories of plugins
-
Parser Changes
- Now prioritizes using plugins over local functions when a corresponding plugin is available in the registry.
- Added dynamic axes support for
Squeeze
andUnsqueeze
operations. - Added support for parsing mixed-precision
BatchNormalization
nodes in strongly-typed mode.
-
Addressed Issues
- Fixed 4113.