You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks very much for the great work. I find that the W4A4 contains group, then the gemm accumulation may not be executed inside the tensor core using int32 accumulator, may I ask how the performance of this method compares to W8A8 without groups? Could you please provide some statistics ?
Thanks very much
The text was updated successfully, but these errors were encountered:
Thanks very much for the great work. I find that the W4A4 contains group, then the gemm accumulation may not be executed inside the tensor core using int32 accumulator, may I ask how the performance of this method compares to W8A8 without groups? Could you please provide some statistics ?
Thanks very much
The text was updated successfully, but these errors were encountered: