Chat: uplift errors of model on user requests #144

tiero · 2023-11-08T11:16:06Z

We can start from Mistral binaries, for example instead of returing a general Load failed we should return the ValueError below

    | Traceback (most recent call last):
    |   File "starlette/responses.py", line 273, in wrap
    |   File "starlette/responses.py", line 262, in stream_response
    |   File "routes.py", line 64, in generate_chunk_based_response
    |   File "llama_cpp/llama.py", line 1540, in _convert_text_completion_chunks_to_chat
    |   File "llama_cpp/llama.py", line 947, in _create_completion
    | ValueError: Requested tokens (3308) exceed context window of 512
    +------------------------------------

The text was updated successfully, but these errors were encountered:

tiero assigned filopedraz Nov 8, 2023

filopedraz removed their assignment Dec 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chat: uplift errors of model on user requests #144

Chat: uplift errors of model on user requests #144

tiero commented Nov 8, 2023

Chat: uplift errors of model on user requests #144

Chat: uplift errors of model on user requests #144

Comments

tiero commented Nov 8, 2023