Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Revert "Fix parse matmulinteger 611" #2933

Closed

Conversation

causten
Copy link
Collaborator

@causten causten commented Mar 29, 2024

Reverts #2922

Ted Themistokleous and others added 21 commits March 22, 2024 22:46
Originally we were using dot and converting the input for MatMulinteger() as it was parsed in.  As a result we were inserting dot() instead of quant_dot into the model.

I've added a change to check for similar data types and then add a shift to an input  where there's a mismatch of int8 and uint8 on input. This is usually seen when dynamicquantizelinear is used as the input as it inserts a uint8 as the zero point which in some models (like gpt2) is operated on.
Handle tidy complexity error and break up functionality into seperate calls
Not needed and its flagged by our parse tests. Quant dot should by default output int32 for the accumulated output
- Add additional checks for input on supported types
- Simplify add_int8_shit to check for not same supported 8-bit args
- Allow for pass through for bias for quants
-Remove extra layer of ifs for broadcast dims + dot/quant_dot output
…se_matmul

Handle the following cases during parse

- one bias (with broadcasts)
- two bias (with broadcasts)
- mismatch between zp and input
- unsupported type

Added cases to gen_onnx for error input
@causten
Copy link
Collaborator Author

causten commented Mar 29, 2024

do not merge, this will be deleted by me

Copy link

codecov bot commented Mar 29, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 91.47%. Comparing base (c6f4a18) to head (1ef8851).

Additional details and impacted files
@@                  Coverage Diff                  @@
##           release/rocm-rel-6.1    #2933   +/-   ##
=====================================================
  Coverage                 91.46%   91.47%           
=====================================================
  Files                       468      468           
  Lines                     17542    17519   -23     
=====================================================
- Hits                      16045    16025   -20     
+ Misses                     1497     1494    -3     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@causten causten force-pushed the release/rocm-rel-6.1 branch from c6f4a18 to 05a7707 Compare March 29, 2024 14:54
@causten causten closed this Apr 1, 2024
@causten causten deleted the revert-2922-fix_parse_matmulinteger_611 branch April 1, 2024 16:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants