Serialize Config from Model #7

Satrat · 2024-04-12T21:18:21Z

add from_pretrained method to QuantizationConfig that creates a config from a model by iterating through the QuantizationSchemes. Simplifies ignore list as much as possible
added helper function for calculating global compression ratio of config
unit test serializing a full config

…yllama test

bfineran · 2024-04-12T22:49:46Z

src/sparsetensors/quantization/quant_config.py

+        quant_scheme_to_layers = []
+        quantization_status = None
+        ignore = []
+        for name, submodule in iter_named_leaf_modules(model):


See TODO comment about allowing for exceptions in leaf nodes for observers. This will be relevant for non frozen quantized models

src/sparsetensors/quantization/utils/helpers.py

Benjamin and others added 6 commits April 12, 2024 15:10

Apply quantization config implementation

9aae8e8

add TODO

8465015

integrate full lifecycle support, QuantizationStatus updates, add tin…

24e04b6

…yllama test

fix comment

b5a07c4

initial implementation

7142a71

add unit test

23e9ae8

bfineran reviewed Apr 12, 2024

View reviewed changes

Base automatically changed from apply-config to main April 15, 2024 15:13

Satrat added 4 commits April 15, 2024 16:17

Merge branch 'main' into serialize_config

dd77890

cleanup is_quantized

b9c9530

clean up targets and ignore lists

845bfb9

global compression ratio and docstrings

1a7984c

Satrat changed the title ~~[DRAFT] WIP for serializing config~~ Serialize Config from Model Apr 16, 2024

Satrat requested review from bfineran and horheynm April 16, 2024 03:15

Satrat marked this pull request as ready for review April 16, 2024 13:52

Satrat and others added 4 commits April 16, 2024 18:43

make sure scale/zp on correct device

faa93c9

helper for model quantization

caeab7d

Merge branch 'fix_device_mismatch' into serialize_config

e7e6f43

Merge branch 'main' into serialize_config

ec2ef84

bfineran approved these changes Apr 16, 2024

View reviewed changes

bfineran merged commit edc35a1 into main Apr 16, 2024
2 checks passed

bfineran deleted the serialize_config branch April 16, 2024 19:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serialize Config from Model #7

Serialize Config from Model #7

Satrat commented Apr 12, 2024 •

edited

Loading

bfineran Apr 12, 2024

Serialize Config from Model #7

Serialize Config from Model #7

Conversation

Satrat commented Apr 12, 2024 • edited Loading

bfineran Apr 12, 2024

Choose a reason for hiding this comment

Satrat commented Apr 12, 2024 •

edited

Loading