Can we use multi GPU while exporting (diffusers ) onnx model? #96

wxsms · 2024-10-29T08:17:15Z

I'm building a SDXL model in float16 using 4090x2, therefore the GPU memory available is ~48GB.

however, the script in diffusers/quantizatoin does not looks like to able to use both of them, and raise OOM error while exporting onnx model.

I tried to export the model using CPU, but it's too slow.

The text was updated successfully, but these errors were encountered:

jingyu-ml · 2024-10-29T23:30:30Z

@wxsma
could you try something like this?

        backbone.eval()
        with torch.no_grad():
            modelopt_export_sd(backbone, f"{str(args.onnx_dir)}", args.model, args.format)

And also move other parts to cpu, like vae and clip. Please let me know if it works.

wxsms · 2024-11-01T15:53:02Z

@wxsma could you try something like this?
    backbone.eval()
    with torch.no_grad():
        modelopt_export_sd(backbone, f"{str(args.onnx_dir)}", args.model, args.format)
And also move other parts to cpu, like vae and clip. Please let me know if it works.

Sadly it does not work. I manage to export the onnx model in A800 and complie in 4090.

jingyu-ml · 2024-11-05T07:55:40Z

I'll take a look and get back to you, I barely tested on 4090. Just to confirm, can you export the FP16 SDXL on a 4090?

wxsms · 2024-11-05T10:14:42Z

Thank you, I will try it later.

ZhenshengWu · 2024-11-19T07:06:35Z

Has there been any progress on this issue? I encountered the same problem on an RTX 4090. Eventually, I performed the ONNX model conversion on an A800. Using nvidia-smi, I noticed that the ONNX conversion process requires around 30GB of VRAM

model:SDXL-1.0

jingyu-ml self-assigned this Oct 29, 2024

jingyu-ml mentioned this issue Oct 29, 2024

quantization flux-dev int8 or fp8 occur oom in L40s #89

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we use multi GPU while exporting (diffusers ) onnx model? #96

Can we use multi GPU while exporting (diffusers ) onnx model? #96

wxsms commented Oct 29, 2024 •

edited

Loading

jingyu-ml commented Oct 29, 2024 •

edited

Loading

wxsms commented Nov 1, 2024

jingyu-ml commented Nov 5, 2024

wxsms commented Nov 5, 2024

ZhenshengWu commented Nov 19, 2024 •

edited

Loading

Can we use multi GPU while exporting (diffusers ) onnx model? #96

Can we use multi GPU while exporting (diffusers ) onnx model? #96

Comments

wxsms commented Oct 29, 2024 • edited Loading

jingyu-ml commented Oct 29, 2024 • edited Loading

wxsms commented Nov 1, 2024

jingyu-ml commented Nov 5, 2024

wxsms commented Nov 5, 2024

ZhenshengWu commented Nov 19, 2024 • edited Loading

wxsms commented Oct 29, 2024 •

edited

Loading

jingyu-ml commented Oct 29, 2024 •

edited

Loading

ZhenshengWu commented Nov 19, 2024 •

edited

Loading