Skip to content

Actions: stanford-crfm/helm

Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,197 workflow runs
3,197 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

GPQA Few-shot CoT, adapter part
Test #7482: Pull request #3099 synchronize by liamjxu
October 31, 2024 04:05 13m 10s jialiang/gpqa_cot_adapter
October 31, 2024 04:05 13m 10s
Add Llama-Omni-8B (#3119)
Test #7481: Commit 5bb2f59 pushed by ImKeTT
October 31, 2024 03:15 13m 2s main
October 31, 2024 03:15 13m 2s
Add Llama-Omni-8B
Test #7480: Pull request #3119 synchronize by ImKeTT
October 31, 2024 03:01 13m 14s ImKeTT:audiollms-v2
October 31, 2024 03:01 13m 14s
Add stop sequence support to MistralClient
Test #7479: Pull request #3120 opened by yifanmai
October 31, 2024 03:01 15m 49s yifanmai/fix-mistral-stop
October 31, 2024 03:01 15m 49s
Add Llama-Omni-8B
Test #7478: Pull request #3119 synchronize by ImKeTT
October 31, 2024 02:24 13m 37s ImKeTT:audiollms-v2
October 31, 2024 02:24 13m 37s
Add Llama-Omni-8B
Test #7477: Pull request #3119 opened by ImKeTT
October 31, 2024 01:39 4m 49s ImKeTT:audiollms-v2
October 31, 2024 01:39 4m 49s
GPQA Few-shot CoT, adapter part
Test #7476: Pull request #3099 synchronize by liamjxu
October 31, 2024 00:28 5m 6s jialiang/gpqa_cot_adapter
October 31, 2024 00:28 5m 6s
GPQA Few-shot CoT, adapter part
Test #7475: Pull request #3099 synchronize by liamjxu
October 30, 2024 23:00 32m 28s jialiang/gpqa_cot_adapter
October 30, 2024 23:00 32m 28s
Add IEMOCAP and MELD scenarios
Test #7474: Pull request #3113 opened by yifanmai
October 30, 2024 20:31 28m 55s yifanmai/fix-iemocap
October 30, 2024 20:31 28m 55s
Add SUMO Web Claims Summarization scenario
Test #7473: Pull request #3112 opened by yifanmai
October 30, 2024 18:30 10m 53s yifanmai/fix-sumosum
October 30, 2024 18:30 10m 53s
Fix minor bug in punkt installation logic (#3111)
Test #7472: Commit 79fbde1 pushed by yifanmai
October 29, 2024 03:58 13m 43s main
October 29, 2024 03:58 13m 43s
Add new Ministral and Mistral Small models (#3110)
Test #7471: Commit 0a7273a pushed by yifanmai
October 29, 2024 03:49 13m 45s main
October 29, 2024 03:49 13m 45s
Fix minor bug in punkt installation logic
Test #7470: Pull request #3111 opened by yifanmai
October 29, 2024 03:43 13m 29s yifanmai/fix-punkt-detection
October 29, 2024 03:43 13m 29s
Allow setting device for Hugging Face models (#3109)
Test #7469: Commit 9376c1c pushed by yifanmai
October 29, 2024 03:37 12m 27s main
October 29, 2024 03:37 12m 27s
Add new Ministral and Mistral Small models
Test #7468: Pull request #3110 opened by yifanmai
October 29, 2024 03:34 12m 52s yifanmai/fix-new-mistral
October 29, 2024 03:34 12m 52s
Allow setting device for Hugging Face models
Test #7467: Pull request #3109 synchronize by yifanmai
October 29, 2024 03:02 15m 12s yifanmai/fix-hf-client-device
October 29, 2024 03:02 15m 12s
Allow setting device for Hugging Face models
Test #7466: Pull request #3109 synchronize by yifanmai
October 29, 2024 03:00 13m 16s yifanmai/fix-hf-client-device
October 29, 2024 03:00 13m 16s
Build frontend (#3105)
Test #7464: Commit b33c10c pushed by yifanmai
October 29, 2024 02:38 12m 47s main
October 29, 2024 02:38 12m 47s
Fix typo in downloading_raw_results.md (#3102)
Test #7463: Commit 15cb92f pushed by yifanmai
October 29, 2024 02:30 13m 53s main
October 29, 2024 02:30 13m 53s
Added Mistral 2 Large and Llama 3.1 models on Amazon Bedrock (#3095)
Test #7462: Commit 2b1fad5 pushed by yifanmai
October 28, 2024 21:49 13m 20s main
October 28, 2024 21:49 13m 20s
Changed MMLU Pro for Non-COT Version
Test #7461: Pull request #3108 opened by siyagoel
October 28, 2024 21:41 13m 49s siyagoel/mmluprofinal
October 28, 2024 21:41 13m 49s
Bump werkzeug from 3.0.4 to 3.0.6 in the pip group across 1 directory…
Test #7460: Commit 402dbbd pushed by yifanmai
October 28, 2024 17:14 13m 7s main
October 28, 2024 17:14 13m 7s
CoVost-2: Speech Machine Translation (#3106)
Test #7459: Commit 4b82dfd pushed by teetone
October 28, 2024 04:57 13m 48s main
October 28, 2024 04:57 13m 48s
Fix "science & technology" subset of MMSTAR (#3107)
Test #7458: Commit 49e8a11 pushed by ImKeTT
October 28, 2024 02:13 13m 43s main
October 28, 2024 02:13 13m 43s