Skip to content

Actions: tatsu-lab/alpaca_eval

alpaca_eval unit tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
10 workflow run results
10 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

test leaderboard
alpaca_eval unit tests #225: Pull request #149 synchronize by YannDubs
October 24, 2023 21:12 57s test
October 24, 2023 21:12 57s
test leaderboard
alpaca_eval unit tests #224: Pull request #149 opened by YannDubs
October 24, 2023 21:05 3m 49s test
October 24, 2023 21:05 3m 49s
feat: add test for updating leaderboard
alpaca_eval unit tests #223: Commit afdef27 pushed by YannDubs
October 24, 2023 21:04 4m 53s main
October 24, 2023 21:04 4m 53s
fix: causal LM in models rather than evaluators
alpaca_eval unit tests #222: Commit 0137777 pushed by YannDubs
October 24, 2023 21:00 4m 37s main
October 24, 2023 21:00 4m 37s
Add CausalLM/14B to AlpacaEval (#148)
alpaca_eval unit tests #221: Commit eb3b187 pushed by YannDubs
October 24, 2023 20:57 4m 28s main
October 24, 2023 20:57 4m 28s
Add claude2-alpaca-13b, recycled-wizardlm-7b-v1.0, recycled-wizardlm-…
alpaca_eval unit tests #219: Commit 0b5bc69 pushed by YannDubs
October 23, 2023 06:40 4m 16s main
October 23, 2023 06:40 4m 16s
Add claude2-alpaca-13b, recycled-wizardlm-7b-v1.0, recycled-wizardlm-…
alpaca_eval unit tests #218: Pull request #147 opened by MingLiiii
October 22, 2023 04:57 5m 0s main
October 22, 2023 04:57 5m 0s
fix: neft config
alpaca_eval unit tests #217: Commit 9d0eef9 pushed by YannDubs
October 19, 2023 06:08 4m 57s main
October 19, 2023 06:08 4m 57s
Add NEFTune models to AlpacaEval (#146)
alpaca_eval unit tests #216: Commit a957814 pushed by YannDubs
October 19, 2023 05:57 4m 7s main
October 19, 2023 05:57 4m 7s
Add NEFTune models to AlpacaEval
alpaca_eval unit tests #215: Pull request #146 opened by neelsjain
October 18, 2023 21:33 5m 7s NEFTune-models
October 18, 2023 21:33 5m 7s