Skip to content

TensorRT OSS v10.7.0

Latest
Compare
Choose a tag to compare
@kevinch-nv kevinch-nv released this 05 Dec 21:16
· 1 commit to release/10.7 since this release
17003e4

10.7.0 GA

For more information, see the TensorRT 10.7.0 release notes.

Key Feature and Updates:

  • Demo Changes

    • demoDiffusion
      • Enabled low-vram for the Flux pipeline. Users can now run the pipelines on systems with 32GB VRAM.
      • Added support for FLUX.1-schnell pipeline.
      • Enabled weight streaming mode for Flux pipeline.
  • Plugin Changes

    • On Blackwell and later platforms, TensorRT will drop cuDNN support on the following categories of plugins
      • User-written IPluginV2Ext, IPluginV2DynamicExt, and IPluginV2IOExt plugins that are dependent on cuDNN handles provided by TensorRT (via the attachToContext() API).
      • TensorRT standard plugins that use cuDNN, specifically:
        • InstanceNormalization_TRT (version: 1, 2, and 3) present in plugin/instanceNormalizationPlugin/.
        • GroupNormalizationPlugin (version: 1) present in plugin/groupNormalizationPlugin/.
        • Note: These normalization plugins are superseded by TensorRT’s native INormalizationLayer (C++, Python). TensorRT support for cuDNN-dependent plugins remain unchanged on pre-Blackwell platforms.
  • Parser Changes

    • Now prioritizes using plugins over local functions when a corresponding plugin is available in the registry.
    • Added dynamic axes support for Squeeze and Unsqueeze operations.
    • Added support for parsing mixed-precision BatchNormalization nodes in strongly-typed mode.
  • Addressed Issues