Skip to content

Keep Getting Different Outputs With The Same Seed #6275

Closed Answered by GrennKren
GrennKren asked this question in Q&A
Discussion options

You must be logged in to vote

Instead of using AWQ, I discovered that I can load the model without quantization using a Transformer loader.

This actually works out better since I can choose to run the model in full precision, 8-bit, or 4-bit.
For 8-bit or 4-bit, you just need to add the --load-in-4bit or --load-in-8bit flag.

So yeah, problem solved 🤍

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@GrennKren
Comment options

@GrennKren
Comment options

Answer selected by GrennKren
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant