Skip to content

Actions: tatsu-lab/alpaca_eval

alpaca_eval unit tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
592 workflow runs
592 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add Zephyr 7B evals (#152)
alpaca_eval unit tests #232: Commit 6c90977 pushed by YannDubs
October 26, 2023 23:11 3m 59s main
October 26, 2023 23:11 3m 59s
Add Evo v2 7B
alpaca_eval unit tests #231: Pull request #153 synchronize by zfang
October 26, 2023 22:50 4m 45s zfang:main
October 26, 2023 22:50 4m 45s
Add Evo v2 7B
alpaca_eval unit tests #230: Pull request #153 opened by zfang
October 26, 2023 22:36 4m 55s zfang:main
October 26, 2023 22:36 4m 55s
Add Zephyr 7B evals
alpaca_eval unit tests #229: Pull request #152 synchronize by lewtun
October 26, 2023 20:03 4m 1s huggingface:add-zephyr
October 26, 2023 20:03 4m 1s
Add Zephyr 7B evals
alpaca_eval unit tests #228: Pull request #152 opened by lewtun
October 26, 2023 17:46 5m 0s huggingface:add-zephyr
October 26, 2023 17:46 5m 0s
Add decoder for calling Anthropic models via Amazon Bedrock
alpaca_eval unit tests #227: Pull request #151 synchronize by billcai
October 25, 2023 11:25 4m 2s billcai:main
October 25, 2023 11:25 4m 2s
Add decoder for calling Anthropic models via Amazon Bedrock
alpaca_eval unit tests #226: Pull request #151 opened by billcai
October 25, 2023 03:00 4m 34s billcai:main
October 25, 2023 03:00 4m 34s
test leaderboard
alpaca_eval unit tests #225: Pull request #149 synchronize by YannDubs
October 24, 2023 21:12 57s test
October 24, 2023 21:12 57s
test leaderboard
alpaca_eval unit tests #224: Pull request #149 opened by YannDubs
October 24, 2023 21:05 3m 49s test
October 24, 2023 21:05 3m 49s
feat: add test for updating leaderboard
alpaca_eval unit tests #223: Commit afdef27 pushed by YannDubs
October 24, 2023 21:04 4m 53s main
October 24, 2023 21:04 4m 53s
fix: causal LM in models rather than evaluators
alpaca_eval unit tests #222: Commit 0137777 pushed by YannDubs
October 24, 2023 21:00 4m 37s main
October 24, 2023 21:00 4m 37s
Add CausalLM/14B to AlpacaEval (#148)
alpaca_eval unit tests #221: Commit eb3b187 pushed by YannDubs
October 24, 2023 20:57 4m 28s main
October 24, 2023 20:57 4m 28s
Add claude2-alpaca-13b, recycled-wizardlm-7b-v1.0, recycled-wizardlm-…
alpaca_eval unit tests #219: Commit 0b5bc69 pushed by YannDubs
October 23, 2023 06:40 4m 16s main
October 23, 2023 06:40 4m 16s
Add claude2-alpaca-13b, recycled-wizardlm-7b-v1.0, recycled-wizardlm-…
alpaca_eval unit tests #218: Pull request #147 opened by MingLiiii
October 22, 2023 04:57 5m 0s main
October 22, 2023 04:57 5m 0s
fix: neft config
alpaca_eval unit tests #217: Commit 9d0eef9 pushed by YannDubs
October 19, 2023 06:08 4m 57s main
October 19, 2023 06:08 4m 57s
Add NEFTune models to AlpacaEval (#146)
alpaca_eval unit tests #216: Commit a957814 pushed by YannDubs
October 19, 2023 05:57 4m 7s main
October 19, 2023 05:57 4m 7s
Add NEFTune models to AlpacaEval
alpaca_eval unit tests #215: Pull request #146 opened by neelsjain
October 18, 2023 21:33 5m 7s NEFTune-models
October 18, 2023 21:33 5m 7s
ProTip! You can narrow down the results and go further in time using created:<2023-10-18 or the other filters available.