Skip to content

Commit

Permalink
gpt4 turbo -> minimal
Browse files Browse the repository at this point in the history
  • Loading branch information
YannDubs committed Nov 8, 2023
1 parent e3f4821 commit 6537bb7
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion docs/alpaca_eval_gpt4_leaderboard.csv
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
name,win_rate,avg_length,link,samples,filter
GPT-4 Turbo,97.69900497512438,2049,,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/gpt4_turbo/model_outputs.json,community
GPT-4 Turbo,97.69900497512438,2049,,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/gpt4_turbo/model_outputs.json,minimal
XwinLM 70b V0.1,95.56803995,1775,https://github.com/Xwin-LM/Xwin-LM,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/xwinlm-70b-v0.1/model_outputs.json,community
GPT-4,95.27950311,1365,,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/gpt4/model_outputs.json,minimal
LLaMA2 Chat 70B,92.66169154,1790,https://ai.meta.com/llama/,https://github.com/tatsu-lab/alpaca_eval/blob/main/results/llama-2-70b-chat-hf/model_outputs.json,minimal
Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
,win_rate,standard_error,n_wins,n_wins_base,n_draws,n_total,mode,avg_length
gpt4_turbo,97.69900497512438,0.5104849118311993,783,16,5,804,community,2049
gpt4_turbo,97.69900497512438,0.5104849118311993,783,16,5,804,minimal,2049
xwinlm-70b-v0.1,95.56803995,0.724941926,765,35,1,801,community,1775
gpt4,95.27950311,0.71628144,761,32,12,805,minimal,1365
llama-2-70b-chat-hf,92.66169154,0.911762258,743,57,4,804,minimal,1790
Expand Down

0 comments on commit 6537bb7

Please sign in to comment.