HQO for scale/zero point #937

Giuseppe5 · 2024-04-17T14:22:59Z

Implement HQO optimization for scale and zero point.
Caveats:

Scale does not seem to perform that well
Zero point supports per_tensor/per_channel/per_group (asym only)
Scale supports per_tensor/per_channel
More numerical tests are needed to evaluate the impact of HQO compared to other techniques
Scale implementation is INT-only
Zero point implementation does not make sense for FP8 since we mostly do symmetric quantization

Maybe worth to merge after #1002 and expand tests

Giuseppe5 · 2024-04-17T14:24:14Z

src/brevitas/core/stats/stats_op.py

+                candidate[torch.isnan(candidate)] = self.internal_candidate[torch.isnan(candidate)]
+                candidate = self.clamp_min_ste(candidate)
+                bit_width = self.msb_clamp_bit_width_impl()
+                int_threshold = self.int_scaling_impl(bit_width)


Int specific implementation, maybe it should be generalized to also cover minifloat case

maybe it should be generalized to also cover minifloat case

This looks like it hasn't been resolved to me. Is that true?

src/brevitas/quant/base.py

src/brevitas/core/stats/stats_op.py

nickfraser

Is it intentional that this only works for Int?

nickfraser · 2024-09-04T13:29:16Z

src/brevitas/core/stats/stats_op.py

+                candidate[torch.isnan(candidate)] = self.internal_candidate[torch.isnan(candidate)]
+                candidate = self.clamp_min_ste(candidate)
+                bit_width = self.msb_clamp_bit_width_impl()
+                int_threshold = self.int_scaling_impl(bit_width)


maybe it should be generalized to also cover minifloat case

This looks like it hasn't been resolved to me. Is that true?

Giuseppe5 commented Apr 17, 2024

View reviewed changes

src/brevitas/quant/base.py Outdated Show resolved Hide resolved

Giuseppe5 mentioned this pull request Apr 17, 2024

HQQ + Brevitas mobiusml/hqq#56

Closed

Giuseppe5 force-pushed the hqo branch from bbee578 to ffa5059 Compare May 27, 2024 09:11

Giuseppe5 requested a review from nickfraser May 27, 2024 09:13

Giuseppe5 force-pushed the hqo branch 2 times, most recently from 47c9f88 to 5cc7648 Compare August 13, 2024 21:47

nickfraser assigned Giuseppe5 Aug 14, 2024

nickfraser added the next release PRs which should be merged for the next release label Aug 14, 2024

Giuseppe5 force-pushed the hqo branch from 5cc7648 to 5eba931 Compare August 21, 2024 12:19

Giuseppe5 commented Aug 21, 2024

View reviewed changes

src/brevitas/core/stats/stats_op.py Outdated Show resolved Hide resolved

Giuseppe5 requested review from nickfraser and removed request for nickfraser August 21, 2024 14:08

Giuseppe5 force-pushed the hqo branch from ce6b0e3 to 32322a5 Compare September 1, 2024 23:11

nickfraser reviewed Sep 4, 2024

View reviewed changes

nickfraser force-pushed the hqo branch from 32322a5 to 6e2f998 Compare September 5, 2024 14:24

nickfraser self-requested a review September 5, 2024 14:25

Giuseppe5 force-pushed the hqo branch from 6e2f998 to d21f9a8 Compare September 11, 2024 15:51

Giuseppe5 requested review from nickfraser and removed request for nickfraser September 11, 2024 16:31

Giuseppe5 added 3 commits September 12, 2024 14:35

HQO

6e0c4fc

fix tests

16d75d8

Fix local loss tests + JIT

04a7de8

Giuseppe5 force-pushed the hqo branch from 858bb8c to 04a7de8 Compare September 12, 2024 13:36

Giuseppe5 requested review from nickfraser and removed request for nickfraser September 12, 2024 13:36

nickfraser merged commit 3a9bcc6 into Xilinx:dev Sep 12, 2024
372 of 374 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HQO for scale/zero point #937

HQO for scale/zero point #937

Giuseppe5 commented Apr 17, 2024 •

edited

Loading

Giuseppe5 Apr 17, 2024 •

edited

Loading

nickfraser Sep 4, 2024 •

edited

Loading

nickfraser left a comment

nickfraser Sep 4, 2024 •

edited

Loading

HQO for scale/zero point #937

HQO for scale/zero point #937

Conversation

Giuseppe5 commented Apr 17, 2024 • edited Loading

Giuseppe5 Apr 17, 2024 • edited Loading

Choose a reason for hiding this comment

nickfraser Sep 4, 2024 • edited Loading

Choose a reason for hiding this comment

nickfraser left a comment

Choose a reason for hiding this comment

nickfraser Sep 4, 2024 • edited Loading

Choose a reason for hiding this comment

Giuseppe5 commented Apr 17, 2024 •

edited

Loading

Giuseppe5 Apr 17, 2024 •

edited

Loading

nickfraser Sep 4, 2024 •

edited

Loading

nickfraser Sep 4, 2024 •

edited

Loading