[Bug] use new 4bits quantizated models of internlm2, decoded word starts with a blank. #2651

zhulinJulia24 · 2024-10-24T11:28:02Z

Checklist

1. I have searched related issues but cannot get the expected help.
2. The bug has not been fixed in the latest version.
3. Please note that if the bug-related issue you submitted lacks corresponding environment info and a minimal reproducible demo, it will be challenging for us to reproduce and resolve the issue, reducing the likelihood of receiving feedback.

Describe the bug

Reproduction

do quantization on transformers>=4.45.0

lmdeploy lite auto_awq internlm/internlm2_5-7b-chat --work-dir internlm2_5-7b-chat-inner-4bits --batch-size 32

run script

tokenizer = AutoTokenizer.from_pretrained('/nvme/qa_test_models/internlm/internlm2_5-7b-chat-inner-4bits', trust_remote_code=True)

decode = tokenizer.decode(2)
print(decode)
print(tokenizer.encode(decode))

Environment

transformers>=4.45.0

Error traceback

No response

The text was updated successfully, but these errors were encountered:

zhulinJulia24 assigned AllentDan Oct 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] use new 4bits quantizated models of internlm2, decoded word starts with a blank. #2651

[Bug] use new 4bits quantizated models of internlm2, decoded word starts with a blank. #2651

zhulinJulia24 commented Oct 24, 2024

[Bug] use new 4bits quantizated models of internlm2, decoded word starts with a blank. #2651

[Bug] use new 4bits quantizated models of internlm2, decoded word starts with a blank. #2651

Comments

zhulinJulia24 commented Oct 24, 2024

Checklist

Describe the bug

Reproduction

Environment

Error traceback