[Torch FX] Post Quantize Weights Compression #2984

anzr299 · 2024-09-24T06:58:18Z

Changes

Transformation for removing fake quantize nodes and saving all weights to disk in int8 format after quantization. It works as follows:

Reshape the scale if qdq operation is per-channel.
Pattern match the quantize-dequantize nodes.
Filter the matches to only include quantize-dequantize ops with constant input.
Replace with the multiplication of the scale and input.

Reason for changes

To compress the model after quantization

Tests

Add test_post_quantization_compression() in tests/torch/fx/test_model_transformer.py which checks the data type of all weights in the model after applying quantization and also checks the value after the decompression step (element-wise multiplication operation).

Tickets

#2766

…ion_transformation

…/github.com/anzr299/nncf into fx_post_quantize_compression_transformation

…ion_transformation

…/github.com/anzr299/nncf into fx_post_quantize_compression_transformation

nncf/experimental/torch/fx/quantization/quantize_model.py

nncf/experimental/torch/fx/transformations.py

2. change variable names

…ion_transformation

nncf/experimental/torch/fx/quantization/quantize_model.py

nncf/experimental/torch/fx/transformations.py

…/github.com/anzr299/nncf into fx_post_quantize_compression_transformation

Co-authored-by: Daniil Lyakhov <[email protected]>

daniil-lyakhov

LGTM

tests/torch/fx/test_models.py

Co-authored-by: Daniil Lyakhov <[email protected]>

nncf/experimental/torch/fx/transformations.py

…ion_transformation

…qdq pattern

daniil-lyakhov

Minor

nncf/experimental/torch/fx/transformations.py

alexsu52

LGTM

### Changes * ~~Constant folding is applied to all TorchFX models before the quantization~~ * Some torchvision models (swin_v2_s, vit_16_b) are exported by `torch.export.export` before ov conversation * Moc transformations are applied to openvino compressed models after the compression After the #2984 * Fixed `_compress_qdq_constant_transformation` for per tensor case ### Reason for changes * To align TorchFX/OV quantized models ### Related tickets #2766 ### Tests post_training_quantization/504/ is finished successfully

anzr299 added 17 commits August 20, 2024 13:23

Update torch_fx_backend.py

f9e5d7c

Merge branch 'openvinotoolkit:develop' into develop

5b11455

Merge branch 'openvinotoolkit:develop' into develop

0eff5cb

Merge branch 'openvinotoolkit:develop' into develop

c7b9093

Merge branch 'openvinotoolkit:develop' into develop

e7097bd

Merge branch 'openvinotoolkit:develop' into develop

2665666

Merge branch 'openvinotoolkit:develop' into develop

1b4a926

Merge branch 'openvinotoolkit:develop' into develop

74d8f4c

Merge branch 'openvinotoolkit:develop' into develop

415a222

post quantize compression transformation init

75978ac

Merge branch 'openvinotoolkit:develop' into fx_post_quantize_compress…

f231f17

…ion_transformation

fix per tensor transformation

b49d9f7

add test for post quantization compression transformation

094802d

Merge branch 'openvinotoolkit:develop' into fx_post_quantize_compress…

9c49e77

…ion_transformation

remove buffer test

b4719a8

Merge branch 'openvinotoolkit:develop' into develop

939a560

Merge branch 'openvinotoolkit:develop' into fx_post_quantize_compress…

7b05343

…ion_transformation

github-actions bot added NNCF PT Pull requests that updates NNCF PyTorch experimental labels Sep 24, 2024

anzr299 added 11 commits September 24, 2024 11:34

update reference graphs

a71e892

Merge branch 'fx_post_quantize_compression_transformation' of https:/…

21826ef

…/github.com/anzr299/nncf into fx_post_quantize_compression_transformation

fix tests

5990008

Merge branch 'openvinotoolkit:develop' into fx_post_quantize_compress…

eba3ea8

…ion_transformation

remove redundant code

1e2c8b5

Merge branch 'fx_post_quantize_compression_transformation' of https:/…

28ca749

…/github.com/anzr299/nncf into fx_post_quantize_compression_transformation

Merge branch 'openvinotoolkit:develop' into develop

cc544ff

disable transformation test for sanity

445466f

test ci value

fff4b1f

Merge branch 'openvinotoolkit:develop' into develop

9a359ab

post quantize compression transformation init

efb490c

fix test

d35392a

daniil-lyakhov reviewed Oct 15, 2024

View reviewed changes

nncf/experimental/torch/fx/quantization/quantize_model.py Outdated Show resolved Hide resolved

nncf/experimental/torch/fx/transformations.py Show resolved Hide resolved

anzr299 added 3 commits October 15, 2024 18:44

1. update constant function docstring

237fd00

2. change variable names

add comment about model buffer update line in quantize_model

3857668

Merge branch 'openvinotoolkit:develop' into fx_post_quantize_compress…

d3ba573

…ion_transformation

alexsu52 requested a review from daniil-lyakhov October 16, 2024 04:47

daniil-lyakhov reviewed Oct 16, 2024

View reviewed changes

anzr299 and others added 6 commits October 16, 2024 13:25

adding space between descriptions and params

caa4213

Merge branch 'fx_post_quantize_compression_transformation' of https:/…

00884f8

…/github.com/anzr299/nncf into fx_post_quantize_compression_transformation

Add spaces, Remove extra code

90342f4

Co-authored-by: Daniil Lyakhov <[email protected]>

pre-commit fix, comment refactoring

bc568ba

Merge branch 'develop' into fx_post_quantize_compression_transformation

30b7773

Fix single node being passed to _set_new_node_meta

31fc7b3

daniil-lyakhov approved these changes Oct 16, 2024

View reviewed changes

tests/torch/fx/test_models.py Outdated Show resolved Hide resolved

anzr299 and others added 3 commits October 16, 2024 15:50

Update tests/torch/fx/test_models.py

f6a7a34

Co-authored-by: Daniil Lyakhov <[email protected]>

Merge branch 'develop' into fx_post_quantize_compression_transformation

55a5716

pre-commit fix

9caa478

alexsu52 reviewed Oct 18, 2024

View reviewed changes

nncf/experimental/torch/fx/transformations.py Outdated Show resolved Hide resolved

anzr299 added 3 commits October 18, 2024 10:41

Merge branch 'openvinotoolkit:develop' into fx_post_quantize_compress…

59512ce

…ion_transformation

change transformation to update weight, zp, scale when replacing the …

e2e0f85

…qdq pattern

update return of qdq constant transformation function

91bc576

daniil-lyakhov reviewed Oct 18, 2024

View reviewed changes

Minor Fixes

feac3b8

daniil-lyakhov approved these changes Oct 18, 2024

View reviewed changes

nncf/experimental/torch/fx/transformations.py Outdated Show resolved Hide resolved

Minor Fix openvinotoolkit#2

9f13704

MaximProshin added the Code Freeze label Oct 21, 2024

alexsu52 approved these changes Oct 21, 2024

View reviewed changes

alexsu52 merged commit 7c94b23 into openvinotoolkit:develop Oct 21, 2024
14 checks passed

daniil-lyakhov mentioned this pull request Oct 22, 2024

[Conformance] TorchFX/OV backends Alignment #2996

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Torch FX] Post Quantize Weights Compression #2984

[Torch FX] Post Quantize Weights Compression #2984

anzr299 commented Sep 24, 2024 •

edited

Loading

daniil-lyakhov left a comment

daniil-lyakhov left a comment

alexsu52 left a comment

[Torch FX] Post Quantize Weights Compression #2984

[Torch FX] Post Quantize Weights Compression #2984

Conversation

anzr299 commented Sep 24, 2024 • edited Loading

Changes

Reason for changes

Tests

Tickets

daniil-lyakhov left a comment

Choose a reason for hiding this comment

daniil-lyakhov left a comment

Choose a reason for hiding this comment

alexsu52 left a comment

Choose a reason for hiding this comment

anzr299 commented Sep 24, 2024 •

edited

Loading