You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Autoquant will iterate through a user module and identify all linear dtype + shapes as well as execution time for different quantization routines. This information is baked into the final model output but it is not easily viewable.
We want to add an api for exposing this information.
We should add the option to generate human readable reports for a given autoquant run on a model. This will be useful for identifying strange autoquant behavior - why was quantX chosen over quantY.
The text was updated successfully, but these errors were encountered:
Summary
Autoquant will iterate through a user module and identify all linear dtype + shapes as well as execution time for different quantization routines. This information is baked into the final model output but it is not easily viewable.
We want to add an api for exposing this information.
We should add the option to generate human readable reports for a given autoquant run on a model. This will be useful for identifying strange autoquant behavior - why was quantX chosen over quantY.
The text was updated successfully, but these errors were encountered: