Add support for native asymmetric quantization to AQTv2. #725

phoenix-meadowlark · 2024-09-19T19:18:06Z

AQTv2 supports biases and will soon support asymmetric quantization, but only via fake quantization. Supporting native integer asymmetric quantization requires calculating the cross terms in DotGeneralQuantizer (AQTv2's conv and dot_general operation quantizer).

The text was updated successfully, but these errors were encountered:

Integration of native quantization with biases will require computing the cross terms. See [#725](#725) Itemized changes: - Add `IntAsymmetric` to handle asymmetric integer numerics. - this class forgoes some of the more research-y parameters present on `IntSymmetric`. - Add `MinMaxCalibration` to calculate the scale and bias for asymmetric quantization. I additionally tested this change by training MNIST models using `flax_e2e_model`. With symmetric quantization the model fails to converge for `config.config_v4(fwd_bits=2, dlhs_bits=None, drhs_bits=None)` (due to `NaN` losses). With asymmetric quantization the model converges even with `config.config_v4(fwd_bits=2, dlhs_bits=2, drhs_bits=4)`. PiperOrigin-RevId: 651580879

copybara-service bot mentioned this issue Sep 19, 2024

Add support asymmetric fake-quantization to AQTv2. #675

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for native asymmetric quantization to AQTv2. #725

Add support for native asymmetric quantization to AQTv2. #725

phoenix-meadowlark commented Sep 19, 2024

Add support for native asymmetric quantization to AQTv2. #725

Add support for native asymmetric quantization to AQTv2. #725

Comments

phoenix-meadowlark commented Sep 19, 2024