This repository has been archived by the owner on Dec 6, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 7
Mistral 7B Instruct - "cannot parse response" after one or two response #142
Labels
bug
Something isn't working
Comments
To replicate the regression bug (and maybe is time to have an end-to-end test to run automatically)
curl --location 'http://localhost:8447/v1/chat/completions' \
--header 'Content-Type: application/json' \
--data '{
"model": "mistral-7b-instruct-v0.1.Q5_0.gguf",
"messages": [
{
"role": "user",
"content": "explain Bitcoin like I am 5"
}
],
"stream": true,
"temperature": 0.2,
"max_tokens": 256,
"top_p": 0.95,
"frequency_penalty": 0,
"n": 1,
"presence_penalty": 0
}'
curl --location 'http://localhost:8447/v1/chat/completions' \
--header 'Content-Type: application/json' \
--data '{
"model": "mistral-7b-instruct-v0.1.Q5_0.gguf",
"messages": [
{
"role": "user",
"content": "do it with emoji"
}
],
"stream": true,
"temperature": 0.2,
"max_tokens": 256,
"top_p": 0.95,
"frequency_penalty": 0,
"n": 1,
"presence_penalty": 0
}' Response event: completion
data: {"id": "chatcmpl-d8676dd6-9320-4eb1-ae97-0ef8ad6f7754", "model": "mistral-7b-instruct-v0.1.Q5_0.gguf", "created": 1700658362, "object": "chat.completion.chunk", "choices": [{"index": 0, "delta": {"role": "assistant"}, "finish_reason": null}]}
event: completion
data: {"id": "chatcmpl-d8676dd6-9320-4eb1-ae97-0ef8ad6f7754", "model": "mistral-7b-instruct-v0.1.Q5_0.gguf", "created": 1700658362, "object": "chat.completion.chunk", "choices": [{"index": 0, "delta": {}, "finish_reason": "stop"}]}
event: done
data: [DONE] |
on second call I got this response:
|
Interesting: I assume you using in-process python to run it right? so it may be the packaging (ie. pyinstaller?) as the reason for the divergence? |
tried again with cht-llama-cpp-mistral-1-aarch64-apple-darwin, but got similar response 🤔 can you try on a clean download maybe? |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
See prem-research/app#514
The text was updated successfully, but these errors were encountered: