Ideal Large Language Model host system #8059

arjacy · 2024-06-22T01:46:04Z

arjacy
Jun 22, 2024

Hi everyone.
I'm not up to date, and also inexperienced, so please forgive my ignorance.
Llama.cpp is a really impressive and useful system, but it does not use all cores available, even on a very large system (100+ cores).
I am asking you all, why is this the case? Have you all evaluated and/or considered using the SequenceL interpreting compiler to generate massively thrraded cpp, or, is the real bottleneck memory bandwidth?
Thanks everyone and thanks Grigory!
-Richard

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ideal Large Language Model host system #8059

{{title}}

Replies: 0 comments

Select a reply

Ideal Large Language Model host system #8059

arjacy Jun 22, 2024

Replies: 0 comments

arjacy
Jun 22, 2024