Replies: 2 comments 3 replies
-
Likely a misconfiguration issue. I'm using Qwen 2.5 32B daily without any issues. |
Beta Was this translation helpful? Give feedback.
3 replies
-
Is your binary up to date, few weeks ago crash was linked to kv cache quantization + defrag |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I've try it with Qwen 2.5 32B/72B, perfect for single turn Q&A, but in multi-turn conversation, it got some bugs, like core dumped, or after normal output for a while, it start output random strings, some times it end, sometimes not.
For "random strings output bug", I can make a repeatable case, just wonder is it worth?
I'm use Open Web UI as frontend, and the bug won't occurs when switch to aphrodite-engine.
I've try K/V cache with default(f16) q8_0 q4_0, all of them had "random strings output bug", the later two had "core dumped bug"
Beta Was this translation helpful? Give feedback.
All reactions