Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

porting sq to autoround #199

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open

porting sq to autoround #199

wants to merge 4 commits into from

Conversation

n1ck-guo
Copy link
Contributor

  • example
  • api
  • remove duplication process/code, increase running speed
  • evaluation on few models
  • tune alpha / exec order ?

Signed-off-by: n1ck-guo <[email protected]>
@@ -0,0 +1,16 @@
#!/usr/bin/env python
# -*- coding: utf-8 -*-
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

move it to algorithm_ext

absorb_to_layer = self.remove_unsupported_layers(model, absorb_to_layer, no_absorb_layers)
return absorb_to_layer, no_absorb_layers

def remove_unsupported_layers(self, model, absorb_to_layer, no_absorb_layers):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

refer to autoawq to support handcrafted fused patten for popular models and use these configs first. graph trace has some limitations

"alpha_max": 1.0,
"alpha_step": 0.1,
"shared_criterion": "mean",
"n_samples": 32, ##512 for cuda, 128 for cpu?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tune this parameters for gpu

n1ck-guo added 2 commits July 30, 2024 04:20
Signed-off-by: n1ck-guo <[email protected]>
Signed-off-by: n1ck-guo <[email protected]>
@wenhuach21
Copy link
Contributor

Please wait to merge until version 0.3 is released.

Signed-off-by: n1ck-guo <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants