Feat: improve Brevitas compatibility with `torch.compile` #785

nickfraser · 2023-12-13T14:38:32Z

Currently crashes occasionally occur when using BREVITAS_JIT=1 & torch.compile, ideally these should interoperate without crashing until TorchScript is deprecated by upstream PyTorch.

Giuseppe5 · 2024-08-18T23:52:18Z

Current findings after applying compile to quantized models (e.g., ready for inference):

Current structure of QuantTensor is not amenable to compile
It seems possible to get fullgraph compilation with some modifications to the codebase (e.g., removing global variables, adding zero_zero_point flag to avoid data-dependent conditional flow, allow_in_graph for QuantTensor) for layerwise only quantization. However it seems a bit brittle.
Inheriting from another NamedTuple (IntQuantTensorBase or FloatQuantTensorBase) breaks the is_namedtuple check within PyTorch, which could be patched.
Defining a __new__ method (as we do for type checking) is also not supported. It seems that at compile time the type instance is passed multiple time within the call to new.
The above problems are currently being investigate (hopefully) here : Cannot override __add__ in NamedTuple with __new__ + torch.compile pytorch/pytorch#133762
Even if that were solved, it seems that compile does not like torch.bool dtypes for constant values (e.g., zero_zero_point or training or signed) and it rather prefers simple bool. torch.bool dtype causes the NamedTuple to decay to tuple, e.g. when adding 2 QT.
Calling .item() to get a bool from a torch.bool is kind of supported with certain compile flags

After this, my suggestions for this PR:

Most likely, let's do it post-release
It might be worth getting rid of training vs inference behavior for QT to avoid "data dependent" checks
It might be worth switching to support compile while simultaneously dropping support to PyTorch less than 2.0 to avoid lots of import checks
It might be worth considering the switch to Tensor subclass which might be more amenable with respect to compile. This would also mean dropping support to any PyTorch version less than 2.0
Or we could have a compile supports in export_mode, where we never instantiate QuantTensor at proxy level, and we only deal with dequantized values. Doing this in export mode ensures that we don't need to propagate QuantTensors since everything is cached (e.g., metadata for bias quantization)

nickfraser mentioned this pull request Dec 20, 2023

AssertionError when combining BREVITAS_JIT=1 and torch.compile under PyTorch v2.0.1 #788

Open

nickfraser self-assigned this Feb 7, 2024

nickfraser added the next release PRs which should be merged for the next release label Aug 13, 2024

nickfraser assigned Giuseppe5 Aug 13, 2024

nickfraser and others added 3 commits August 14, 2024 12:23

Feat: Added compile arg to BNN-PYNQ examples

6ccdf7d

More compile

337196a

Torch compile support

5b138d3

Giuseppe5 force-pushed the feat/torchcompile branch from 5104960 to 5b138d3 Compare August 15, 2024 13:21

Giuseppe5 marked this pull request as ready for review August 15, 2024 13:21

Giuseppe5 added 13 commits August 15, 2024 14:23

precommit

cb128bd

Fix import

23ee721

Fix import

c3e7f68

import

e98cf46

missing input in test

9e23d5a

fixes

cfc8ecf

restore stuff

86f6bc5

preserve zero_zero_point

d93b60a

Fix

147ad0b

typo

d3b6e55

typo

a32e2a5

More fixes

ac98fc2

better check?

0f4225b

nickfraser closed this Sep 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: improve Brevitas compatibility with `torch.compile` #785

Feat: improve Brevitas compatibility with `torch.compile` #785

nickfraser commented Dec 13, 2023

Giuseppe5 commented Aug 18, 2024 •

edited

Loading

Feat: improve Brevitas compatibility with torch.compile #785

Feat: improve Brevitas compatibility with torch.compile #785

Conversation

nickfraser commented Dec 13, 2023

Giuseppe5 commented Aug 18, 2024 • edited Loading

Feat: improve Brevitas compatibility with `torch.compile` #785

Feat: improve Brevitas compatibility with `torch.compile` #785

Giuseppe5 commented Aug 18, 2024 •

edited

Loading