You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed that the rounding method uses round(), this results in values ranging from -128 to 128, rather than -128 to 127. So maybe is not a 8bit quant in some cases, at least I found this problem on llama3. I guess this has something to do with the distribution of parameters.
Can you give me some advice?
The text was updated successfully, but these errors were encountered:
I noticed that the rounding method uses round(), this results in values ranging from -128 to 128, rather than -128 to 127. So maybe is not a 8bit quant in some cases, at least I found this problem on llama3. I guess this has something to do with the distribution of parameters.
Can you give me some advice?
The text was updated successfully, but these errors were encountered: