porting sq to autoround #199

n1ck-guo · 2024-07-24T05:43:15Z

Signed-off-by: n1ck-guo <[email protected]>

wenhuach21 · 2024-07-24T05:45:51Z

auto_round/smooth_quant/__init__.py

@@ -0,0 +1,16 @@
+#!/usr/bin/env python
+# -*- coding: utf-8 -*-


move it to algorithm_ext

wenhuach21 · 2024-07-24T05:47:51Z

auto_round/smooth_quant/graph_trace.py

+            absorb_to_layer = self.remove_unsupported_layers(model, absorb_to_layer, no_absorb_layers)
+        return absorb_to_layer, no_absorb_layers
+
+    def remove_unsupported_layers(self, model, absorb_to_layer, no_absorb_layers):


refer to autoawq to support handcrafted fused patten for popular models and use these configs first. graph trace has some limitations

wenhuach21 · 2024-07-24T05:48:41Z

auto_round/smooth_quant/smooth_quant.py

+            "alpha_max": 1.0,
+            "alpha_step": 0.1,
+            "shared_criterion": "mean",
+            "n_samples": 32,  ##512 for cuda, 128 for cpu?


tune this parameters for gpu

Signed-off-by: n1ck-guo <[email protected]>

wenhuach21 · 2024-07-30T08:41:24Z

Please wait to merge until version 0.3 is released.

Signed-off-by: n1ck-guo <[email protected]>

porting sq to autoround

47658da

Signed-off-by: n1ck-guo <[email protected]>

n1ck-guo requested review from wenhuach21 and WeiweiZhang1 July 24, 2024 05:43

wenhuach21 reviewed Jul 24, 2024

View reviewed changes

n1ck-guo added 2 commits July 30, 2024 04:20

new trace method

5fc4246

Signed-off-by: n1ck-guo <[email protected]>

update

0ce0144

Signed-off-by: n1ck-guo <[email protected]>

fix bug

71d2582

Signed-off-by: n1ck-guo <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

porting sq to autoround #199

porting sq to autoround #199

n1ck-guo commented Jul 24, 2024

wenhuach21 Jul 24, 2024

wenhuach21 Jul 24, 2024

wenhuach21 Jul 24, 2024

wenhuach21 commented Jul 30, 2024

		@@ -0,0 +1,16 @@
		#!/usr/bin/env python
		# -- coding: utf-8 --

porting sq to autoround #199

Are you sure you want to change the base?

porting sq to autoround #199

Conversation

n1ck-guo commented Jul 24, 2024

wenhuach21 Jul 24, 2024

Choose a reason for hiding this comment

wenhuach21 Jul 24, 2024

Choose a reason for hiding this comment

wenhuach21 Jul 24, 2024

Choose a reason for hiding this comment

wenhuach21 commented Jul 30, 2024