TRT support for MAISI #8153

borisfom · 2024-10-16T01:21:08Z

Description

Added trt_compile() support for Lists and Tuples in arguments for forward() - needed for MAISI.
Did not add support for grouping return results yet - MAISI worked with explicit workaround unrolling the return results.

Notes

To successfully export MAISI, either latest Torch nightly is needed, or this patch needs to be applied to 24.09-based container:

--- /usr/local/lib/python3.10/dist-packages/torch/onnx/symbolic_opset14.bak     2024-10-09 01:38:04.920316673 +0000                                                   
+++ /usr/local/lib/python3.10/dist-packages/torch/onnx/symbolic_opset14.py      2024-10-09 01:38:25.228053951 +0000                                                   
@@ -148,7 +148,6 @@                                                                                                                                                   
         is_causal and symbolic_helper._is_none(attn_mask)                                                                                                            
     ), "is_causal and attn_mask cannot be set at the same time"                                                                                                      
                                                                                                                                                                      
-    scale = symbolic_helper._maybe_get_const(scale, "f")                                                                                                             
     if symbolic_helper._is_none(scale):                                                                                                                              
         scale = _attention_scale(g, query)

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).

Signed-off-by: Boris Fomitchev <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Boris Fomitchev <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Boris Fomitchev <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Boris Fomitchev <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Boris Fomitchev <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Boris Fomitchev <[email protected]>

…ppers

Signed-off-by: Boris Fomitchev <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Boris Fomitchev <[email protected]>

…ppers Signed-off-by: Boris Fomitchev <[email protected]>

Signed-off-by: Boris Fomitchev <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Boris Fomitchev <[email protected]>

for more information, see https://pre-commit.ci

borisfom · 2024-10-16T01:25:23Z

Also, I did not do any results verification. If any results depend on Meta tensors operation, that part may be lost. Please check!

Signed-off-by: Boris Fomitchev <[email protected]>

monai/apps/generation/maisi/networks/controlnet_maisi.py

KumoLiu · 2024-10-18T13:54:48Z

Dockerfile

@@ -11,7 +11,7 @@

 # To build with a different base image
 # please run `docker build` using the `--build-arg PYTORCH_IMAGE=...` flag.
-ARG PYTORCH_IMAGE=nvcr.io/nvidia/pytorch:24.08-py3
+ARG PYTORCH_IMAGE=nvcr.io/nvidia/pytorch:24.09-py3


May need more test for this base image update.

Well it does not make real difference (patch I mentioned in the description is needed for 24.09 anyway), so I may revert this one for now, too. 24.10 (and 2.5.0) won't require exporter patch.

Does this mean we'll need to update to version 24.10 once it's released, since 24.09 still doesn't meet the requirements, and MAISI still lacks TRT support?
I try to update the base image and trigger more test in this PR #8164, shown an error below:
#8164 (comment)

Yes I believe it's better to skip 24.09 as it still requires a patch.

KumoLiu · 2024-10-22T06:18:35Z

monai/networks/utils.py

@@ -693,7 +695,7 @@ def convert_to_onnx(
            f = io.BytesIO()


Could you please also modify this part based on the latest api from torch.onnx.export? Thanks!
#8149 (comment)

KumoLiu · 2024-10-22T06:19:08Z

Hi @binliunls, please also review the trt related parts in this PR, thanks.

binliunls · 2024-10-23T15:33:40Z