Use device as context manager for init_on_device #1826

shingjan · 2023-08-08T20:24:04Z

HuggingFaceDocBuilderDev · 2023-08-08T22:34:12Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for drafting this! I left a couple of comments, mainly we can't change the default of include_buffers which would break backward compatibility.

src/accelerate/big_modeling.py

tests/test_big_modeling.py

BenjaminBossan

Thanks for the PR. Is it possible to have a test for this or would it be too complicated to set up because of the version dependency?

src/accelerate/big_modeling.py

shingjan · 2023-08-09T21:48:28Z

Thanks @sgugger and @BenjaminBossan for the reviews!

I added one test with include_buffers=True to make sure if testing with torch >= 2.0, we go with the device context manager. This PR however won't be fixing the issue I had before:

with tensor_mode():
  model = transformers.from_pretrained(...)

as init_empty_weights(...) uses include_buffers=False as default. I wonder if you have a better context on why we use include_buffers=False as default for init_empty_weights. If the ultimate goal is to move a nn.Module to meta device in order to save cpu/cuda memory, wouldn't buffer also be included in the state dict of a nn.Module and therefore better to move those to meta as well? Thanks again!

sgugger · 2023-08-10T07:04:50Z

We use include_buffers=False because buffers are usually not very heavy and it ends up taking more time to create them on the meta device and move them around than just creating them on the GPU.

sgugger

Nice new test!

shingjan · 2023-08-10T17:45:32Z

thanks @sgugger and @BenjaminBossan for getting this PR in!

shingjan mentioned this pull request Aug 8, 2023

big_modeling.init_on_device not working with WrapperTensor class like FakeTensor #1814

Closed

4 tasks

shingjan force-pushed the shingjan/use_device branch 3 times, most recently from b488347 to ece578c Compare August 9, 2023 00:27

sgugger reviewed Aug 9, 2023

View reviewed changes

src/accelerate/big_modeling.py Outdated Show resolved Hide resolved

src/accelerate/big_modeling.py Outdated Show resolved Hide resolved

src/accelerate/big_modeling.py Outdated Show resolved Hide resolved

tests/test_big_modeling.py Outdated Show resolved Hide resolved

BenjaminBossan reviewed Aug 9, 2023

View reviewed changes

src/accelerate/big_modeling.py Outdated Show resolved Hide resolved

src/accelerate/big_modeling.py Outdated Show resolved Hide resolved

shingjan force-pushed the shingjan/use_device branch 4 times, most recently from 4afad01 to 7237f9a Compare August 9, 2023 21:42

shingjan requested review from BenjaminBossan and sgugger August 9, 2023 21:42

use device as context manager for init_on_device

32c3dbb

shingjan force-pushed the shingjan/use_device branch from 7237f9a to 32c3dbb Compare August 9, 2023 21:48

sgugger approved these changes Aug 10, 2023

View reviewed changes

sgugger merged commit 058a354 into huggingface:main Aug 10, 2023
24 checks passed

shingjan deleted the shingjan/use_device branch August 10, 2023 17:59

shingjan mentioned this pull request Aug 10, 2023

Add init_include_buffers kwargs to modeling_utils.from_pretrained huggingface/transformers#25448

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use device as context manager for init_on_device #1826

Use device as context manager for init_on_device #1826

shingjan commented Aug 8, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 8, 2023 •

edited

Loading

sgugger left a comment

BenjaminBossan left a comment

shingjan commented Aug 9, 2023 •

edited

Loading

sgugger commented Aug 10, 2023

sgugger left a comment

shingjan commented Aug 10, 2023

Use device as context manager for init_on_device #1826

Use device as context manager for init_on_device #1826

Conversation

shingjan commented Aug 8, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Aug 8, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

shingjan commented Aug 9, 2023 • edited Loading

sgugger commented Aug 10, 2023

sgugger left a comment

Choose a reason for hiding this comment

shingjan commented Aug 10, 2023

shingjan commented Aug 8, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Aug 8, 2023 •

edited

Loading

shingjan commented Aug 9, 2023 •

edited

Loading