fix dispatch for quantized model #1855

SunMarc · 2023-08-17T16:20:06Z

What does this do ?

This PR fixes the dispatch function for quantized model. Only bnb model needs to have hooks in a single gpu setup. For other quantization methods such as GPTQ, we don't need that.

younesbelkada

Makes sense, thanks Marc!

HuggingFaceDocBuilderDev · 2023-08-17T16:25:56Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

fix dispatch

1ac7efa

SunMarc requested a review from younesbelkada August 17, 2023 16:20

younesbelkada approved these changes Aug 17, 2023

View reviewed changes

fxmarty approved these changes Aug 17, 2023

View reviewed changes

SunMarc merged commit 21d1273 into huggingface:main Aug 17, 2023
24 checks passed

SunMarc deleted the fix_dispatch_quantized_model branch August 17, 2023 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix dispatch for quantized model #1855

fix dispatch for quantized model #1855

SunMarc commented Aug 17, 2023 •

edited

Loading

younesbelkada left a comment

HuggingFaceDocBuilderDev commented Aug 17, 2023

fix dispatch for quantized model #1855

fix dispatch for quantized model #1855

Conversation

SunMarc commented Aug 17, 2023 • edited Loading

What does this do ?

younesbelkada left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 17, 2023

SunMarc commented Aug 17, 2023 •

edited

Loading