Fix (GPxQ): unwrap QuantTensor when dealing with QuantLinear #915

fabianandresgrob · 2024-03-20T22:59:27Z

When running GPxQ and we use quantized activations, once a QuantLinear layer is processed, update_batch tries to unsqueeze (GPFQ) or transpose (GPTQ) that QuantTensor, leading to an error.

This PR solves that issue by unwrapping that tensor. It also adds a quant linear model in the fixtures for reproducing the issue.
Some of the tests still fail as it does not raise a ValueError because it actually has a quant_input (here). @i-colbert maybe we need to change the conditions for an expected fail?

Giuseppe5 · 2024-05-14T11:50:48Z

Tests are failing. Is this because of the PR?
Is this still needed?

Giuseppe5 · 2024-05-14T14:18:20Z

@fabianandresgrob would you mind fixing the error so we can merge this?

fabianandresgrob · 2024-05-21T14:37:17Z

No longer needed.

fabianandresgrob force-pushed the fix/gpxq-quant-input branch from 6e8c7a2 to e7c54bf Compare May 14, 2024 13:45

fabianandresgrob added 3 commits May 16, 2024 17:46

Fix (GPxQ): unwrap QuantTensor when dealing with QuantLinear

9ce987b

Rebase

13c664c

Fix: add check to avoid need for raised exception

3589584

fabianandresgrob force-pushed the fix/gpxq-quant-input branch from e7c54bf to 3589584 Compare May 16, 2024 16:47

Giuseppe5 self-requested a review May 21, 2024 11:05

fabianandresgrob closed this May 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix (GPxQ): unwrap QuantTensor when dealing with QuantLinear #915

Fix (GPxQ): unwrap QuantTensor when dealing with QuantLinear #915

fabianandresgrob commented Mar 20, 2024

Giuseppe5 commented May 14, 2024

Giuseppe5 commented May 14, 2024

fabianandresgrob commented May 21, 2024

Fix (GPxQ): unwrap QuantTensor when dealing with QuantLinear #915

Fix (GPxQ): unwrap QuantTensor when dealing with QuantLinear #915

Conversation

fabianandresgrob commented Mar 20, 2024

Giuseppe5 commented May 14, 2024

Giuseppe5 commented May 14, 2024

fabianandresgrob commented May 21, 2024