quantize.py vs convert_checkpoint.py #1606

DreamGenX · 2024-05-15T08:34:12Z

DreamGenX
May 15, 2024

I've noticed that the new convert_checkpoint.py scripts (such as the one for llama) have some quantization options built in, though some, like fp8, are missing. The readme for llama suggests to use quantize.py for fp8 post-training quantization instead.

Is there any reason these two are split?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantize.py vs convert_checkpoint.py #1606

{{title}}

Replies: 0 comments

Select a reply

quantize.py vs convert_checkpoint.py #1606

DreamGenX May 15, 2024

Replies: 0 comments

DreamGenX
May 15, 2024