Using candle-vllm as crate in rust? #62

gkvoelkl · 2024-07-20T10:29:07Z

Hi Eric, great rust programm.

I am looking for a crate so I can use a chatbot function within my rust programm. I tried to to that with candle. I hope it will be more documented in den future.

Will it be possible to call a function of candle-vllm without starting an explicit server? So I can use candle-vllm within my programm.

Thanks

Best regards Gerhard

EricLBuehler · 2024-07-24T19:11:09Z

Hi @gkvoelkl! Candle-vllm is a great option: you can see an example of how to build such a chatbot here in pure Rust: openai_server.rs.

I would also recommend that you check out mistral.rs as it not only has PagedAttention but Metal, Cpu, vision model, adapter models, quantization, and a plethora of other features including a crate which is meant for usage in an application (docs, examples).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using candle-vllm as crate in rust? #62

Using candle-vllm as crate in rust? #62

gkvoelkl commented Jul 20, 2024

EricLBuehler commented Jul 24, 2024 •

edited

Loading

Using candle-vllm as crate in rust? #62

Using candle-vllm as crate in rust? #62

Comments

gkvoelkl commented Jul 20, 2024

EricLBuehler commented Jul 24, 2024 • edited Loading

EricLBuehler commented Jul 24, 2024 •

edited

Loading