cohere update #155

sanderland · 2023-11-02T08:19:04Z

Refreshes Cohere outputs to reflect the most recent model
Uses the new "max_tokens=None" feature in our API to avoid unneeded truncation

YannDubs · 2023-11-02T09:14:23Z

src/alpaca_eval/models_configs/cohere/configs.yaml

  fn_completions: "cohere_completions"
  completions_kwargs:
    model_name: "command-nightly"
-    max_tokens: 2048
+    max_tokens: null  # up to EOS or context length


what's the context length?

Our current context length is 4096

YannDubs · 2023-11-02T09:19:02Z

src/alpaca_eval/leaderboards/data_AlpacaEval/alpaca_eval_gpt4_leaderboard.csv

@@ -5,6 +5,7 @@ llama-2-70b-chat-hf,92.66169154,0.911762258,743,57,4,804,minimal,1790
 ultralm-13b-v2.0-best-of-16,92.29813665,0.940299807,743,62,0,805,community,1720
 xwinlm-13b-v0.1,91.76029963,0.968139439,734,65,2,801,community,1894
 ultralm-13b-best-of-16,91.54228856,0.981927769,736,68,0,804,community,1980
+cohere,91.49068322981367,0.9781229071866879,735,67,3,805,community,2012


impressive jump, is the model updated or is it because you removed the context length limit (the model output seems 300 characters longer)?

This is a new model, the context length limit change gives a small additional bump (to the point it could just have been evaluator noise), but it's also just the current proper way to call our models.

YannDubs · 2023-11-02T09:20:01Z

Impressive results @sanderland, is it on purpose that the PR is marked as draft?

sanderland · 2023-11-02T09:56:16Z

Impressive results @sanderland, is it on purpose that the PR is marked as draft?

@YannDubs just double checking, should be all ready now :)

sanderland added 2 commits November 2, 2023 08:14

cohere update

b21a0eb

merge leaderboard

8ffbf13

YannDubs reviewed Nov 2, 2023

View reviewed changes

sanderland marked this pull request as ready for review November 2, 2023 09:55

YannDubs merged commit c9fbbed into tatsu-lab:main Nov 2, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cohere update #155

cohere update #155

sanderland commented Nov 2, 2023 •

edited

Loading

YannDubs Nov 2, 2023

sanderland Nov 2, 2023

YannDubs Nov 2, 2023

sanderland Nov 2, 2023

YannDubs commented Nov 2, 2023

sanderland commented Nov 2, 2023

cohere update #155

cohere update #155

Conversation

sanderland commented Nov 2, 2023 • edited Loading

YannDubs Nov 2, 2023

Choose a reason for hiding this comment

sanderland Nov 2, 2023

Choose a reason for hiding this comment

YannDubs Nov 2, 2023

Choose a reason for hiding this comment

sanderland Nov 2, 2023

Choose a reason for hiding this comment

YannDubs commented Nov 2, 2023

sanderland commented Nov 2, 2023

sanderland commented Nov 2, 2023 •

edited

Loading