Add support asymmetric fake-quantization to AQTv2. #675

copybara-service · 2024-07-23T02:57:59Z

Add support asymmetric fake-quantization to AQTv2.

Integration of native quantization with biases will require computing the cross terms. See #725

Itemized changes:

Add IntAsymmetric to handle asymmetric integer numerics.
- this class forgoes some of the more research-y parameters present on IntSymmetric.
Add MinMaxCalibration to calculate the scale and bias for asymmetric quantization.

I additionally tested this change by training MNIST models using flax_e2e_model. With symmetric quantization the model fails to converge for config.config_v4(fwd_bits=2, dlhs_bits=None, drhs_bits=None) (due to NaN losses). With asymmetric quantization the model converges even with config.config_v4(fwd_bits=2, dlhs_bits=2, drhs_bits=4).

Integration of native quantization with biases will require computing the cross terms. See [#725](#725) Itemized changes: - Add `IntAsymmetric` to handle asymmetric integer numerics. - this class forgoes some of the more research-y parameters present on `IntSymmetric`. - Add `MinMaxCalibration` to calculate the scale and bias for asymmetric quantization. I additionally tested this change by training MNIST models using `flax_e2e_model`. With symmetric quantization the model fails to converge for `config.config_v4(fwd_bits=2, dlhs_bits=None, drhs_bits=None)` (due to `NaN` losses). With asymmetric quantization the model converges even with `config.config_v4(fwd_bits=2, dlhs_bits=2, drhs_bits=4)`. PiperOrigin-RevId: 651580879

copybara-service bot force-pushed the test_651580879 branch 4 times, most recently from 0071d1a to 33de2e9 Compare July 23, 2024 19:23

copybara-service bot force-pushed the test_651580879 branch from 33de2e9 to b47fe35 Compare August 20, 2024 19:46

copybara-service bot force-pushed the test_651580879 branch 2 times, most recently from ef818dc to cef287a Compare August 29, 2024 23:29

copybara-service bot force-pushed the test_651580879 branch from cef287a to 097249a Compare September 9, 2024 22:36

copybara-service bot force-pushed the test_651580879 branch 9 times, most recently from b066596 to 4694998 Compare September 23, 2024 22:50

copybara-service bot force-pushed the test_651580879 branch from 4694998 to e4fa804 Compare September 27, 2024 22:36

copybara-service bot force-pushed the test_651580879 branch from e4fa804 to ba94cf8 Compare October 4, 2024 22:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support asymmetric fake-quantization to AQTv2. #675

Add support asymmetric fake-quantization to AQTv2. #675

copybara-service bot commented Jul 23, 2024 •

edited

Loading

Add support asymmetric fake-quantization to AQTv2. #675

Are you sure you want to change the base?

Add support asymmetric fake-quantization to AQTv2. #675

Conversation

copybara-service bot commented Jul 23, 2024 • edited Loading

copybara-service bot commented Jul 23, 2024 •

edited

Loading