how to generate a llama.cpp server with fastchat api #16

xx-zhang · 2023-06-27T01:20:55Z

I have set the server , but only few words output like blocked , and is a single progress which can't reponse fastly. it is only run and load model when the request is getting.

fredi-python · 2023-06-27T23:52:16Z

How did you setup the server?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to generate a llama.cpp server with fastchat api #16

how to generate a llama.cpp server with fastchat api #16

xx-zhang commented Jun 27, 2023

fredi-python commented Jun 27, 2023

how to generate a llama.cpp server with fastchat api #16

how to generate a llama.cpp server with fastchat api #16

Comments

xx-zhang commented Jun 27, 2023

fredi-python commented Jun 27, 2023