Skip to content

What model should adjust better to my workflow/needs? #1777

Answered by wsxiaoys
RiQuY asked this question in Q&A
Discussion options

You must be logged in to vote

LLM evaluation is actually the most mystic part of the entire ecosystem, a combination of scientific and intuitive feelings. :)

In general, we receive pretty good feedback regarding the model performance of DeepSeekCoder-6.7B (which tops the leaderboard). We also see some mixed feedback between DeepSeekCoder-1.3B and StarCoder-3B. If you feel that a particular model runs better in your environment but performs poorly on the leaderboard, it's likely that your working setup is more accustomed to that model's training environment.

Anyway, our suggestion is to use the leaderboard as a reference and stick with the model that you feel is the best. Lastly, FYI for enterprise or team-wise use cas…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@RiQuY
Comment options

Answer selected by RiQuY
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants