-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
batch support #726
Comments
when using the llm object I don't see the completions functions support batching. It errors out on encode
however, this same function in openai does allow batching (but if you attempt it with the server v of llama.cpp openai proxy spoof, it will error out). |
Just cleaning up old issues and closing this in favour of #771 |
works with openai's api endpoint, but when attempting to use with llama-cpp-python's api endpoint
I can use a single prompt, but not prompts.
The text was updated successfully, but these errors were encountered: