Replies: 1 comment 3 replies
-
For pth, we currently don't store bit width as part of the state dict with the majority of the pre-defined quantizers (although it is possible to do it if you create a custom one). I am not sure I understand what you mean with comparing each other though. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I have trained a NN model with Brevitas & Torch by using QAT for W1A1, W2A2, W4A4, W8,A8 variations. Then I saved them as PTH and ONNX formats. I want to determine quantization level of the models by just using the exported files. Is it possible? Is there any direct method instead of comparing each other?
Thanks in advance,
Beta Was this translation helpful? Give feedback.
All reactions