Skip to content

Commit

Permalink
Release MMLU v1.10.0, Lite v1.10.0, AIR-Bench v1.2.0 (#3140)
Browse files Browse the repository at this point in the history
  • Loading branch information
yifanmai authored Nov 7, 2024
1 parent a40c760 commit a05c4b0
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions helm-frontend/project_metadata.json
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
"title": "Lite",
"description": "Lightweight, broad evaluation of the capabilities of language models using in-context learning",
"id": "lite",
"releases": ["v1.9.0", "v1.8.0", "v1.7.0", "v1.6.0", "v1.5.0", "v1.4.0", "v1.3.0", "v1.2.0", "v1.1.0", "v1.0.0"]
"releases": ["v1.10.0", "v1.9.0", "v1.8.0", "v1.7.0", "v1.6.0", "v1.5.0", "v1.4.0", "v1.3.0", "v1.2.0", "v1.1.0", "v1.0.0"]
},
{
"title": "Classic",
Expand All @@ -27,7 +27,7 @@
"title": "MMLU",
"description": "Massive Multitask Language Understanding (MMLU) evaluations using standardized prompts",
"id": "mmlu",
"releases": ["v1.9.0", "v1.8.0", "v1.7.0", "v1.6.0", "v1.5.0", "v1.4.0", "v1.3.0", "v1.2.0", "v1.1.0", "v1.0.0"]
"releases": ["v1.10.0", "v1.9.0", "v1.8.0", "v1.7.0", "v1.6.0", "v1.5.0", "v1.4.0", "v1.3.0", "v1.2.0", "v1.1.0", "v1.0.0"]
},
{
"title": "VHELM",
Expand All @@ -45,7 +45,7 @@
"title": "AIR-Bench",
"description": "Safety benchmark based on emerging government regulations and company policies",
"id": "air-bench",
"releases": ["v1.1.0", "v1.0.0"]
"releases": ["v1.2.0", "v1.1.0", "v1.0.0"]
},
{
"title": "CLEVA",
Expand Down

0 comments on commit a05c4b0

Please sign in to comment.