Skip to content

Releases: embeddings-benchmark/mteb

1.29.7

16 Jan 10:27
Compare
Choose a tag to compare

1.29.7 (2025-01-16)

Ci

  • ci: only return 1 model_name per file (#1818)

  • only return 1 model_name per file

  • fix args parse

  • revert test change (d7a7791)

Fix

  • fix: add bge-m3 ModelMeta (#1821)

add bge (4ac59bc)

Unknown

  • Add model inf-retriever-v1 (#1744)

  • feat(models): add infly/inf-retriever-v1 model metadata- Add inf_models.py file with metadata for infly/inf-retriever-v1 model

  • Update overview.py to include inf_models in model imports
  • Reformat code

  • Update inf-retriever-v1 ModelMeta

  • Fill more information for inf-retriever-v1

  • Add license information for inf-retriever-v1


Co-authored-by: Samuel Yang <[email protected]> (60c4980)

1.29.6

15 Jan 15:41
Compare
Choose a tag to compare

1.29.6 (2025-01-15)

Fix

  • fix: Added more Chinese models' ModelMeta (#1814)

  • Added Multilingual USE models

  • Added Moka models

  • Added dmeta models

  • Added jina-zh

  • Added piccolo models (748955c)

1.29.5

15 Jan 14:13
Compare
Choose a tag to compare

1.29.5 (2025-01-15)

Fix

  • fix: Add additional contacts (#1817)

add contacts from #1790 (c4ee9fe)

Unknown

1.29.4

15 Jan 09:58
Compare
Choose a tag to compare

1.29.4 (2025-01-15)

Fix

  • fix: Added ModelMeta for BGE, GTE Chinese and multilingual models (#1811)

  • Added BGE Chinese and multilingual-gemma models

  • Added GTE multilingual and Chinese models

  • Fixed date format (3f5ee82)

  • fix: Zero shot and aggregation on Leaderboard (#1810)

  • Made join_revision filter out no_revision_available when other revisions have been run on the task

  • Fixed zero-shot filtering

  • Fixed aggregation of task types

  • Ran linting (0acc166)

1.29.3

14 Jan 13:03
Compare
Choose a tag to compare

1.29.3 (2025-01-14)

Fix

  • fix: allow to load no revision available (#1801)

  • fix allow to load no revision available

  • lint

  • add require_model_meta to leaderboard

  • lint (a202884)

1.29.2

14 Jan 10:49
Compare
Choose a tag to compare

1.29.2 (2025-01-14)

Documentation

  • docs: Add contact to MMTEB benchmarks (#1796)

  • Add myself to MMTEB benchmarks

  • lint (e9e9118)

Fix

  • fix: loading pre 11 (#1798)

  • fix loading pre 11

  • add similarity

  • lint

  • run all task types (94103e6)

1.29.1

13 Jan 21:27
Compare
Choose a tag to compare

1.29.1 (2025-01-13)

Fix

  • fix: Added C-MTEB (#1786)

Added C-MTEB (3ba7e22)

1.29.0

13 Jan 17:51
Compare
Choose a tag to compare

1.29.0 (2025-01-13)

Ci

  • ci: fix model loading test (#1775)

  • pass base branch into the make command as an arg

  • test a file that has custom wrapper

  • what about overview

  • just dont check overview

  • revert instance check

  • explicitly omit overview and init

  • remove test change

  • try on a lot of models

  • revert test model file


Co-authored-by: Isaac Chung <[email protected]> (9b117a8)

Feature

  • feat: Update task filtering, fixing bug which included cross-lingual tasks in overly many benchmarks (#1787)

  • feat: Update task filtering, fixing bug on MTEB

  • Updated task filtering adding exclusive_language_filter and hf_subset
  • fix bug in MTEB where cross-lingual splits were included
  • added missing language filtering to MTEB(europe, beta) and MTEB(indic, beta)

The following code outlines the problems:

import mteb
from mteb.benchmarks import MTEB_ENG_CLASSIC

task = [t for t in MTEB_ENG_CLASSIC.tasks if t.metadata.name == &#34;STS22&#34;][0]
# was eq. to:
task = mteb.get_task(&#34;STS22&#34;, languages=[&#34;eng&#34;])
task.hf_subsets
# correct filtering to English datasets:
# [&#39;en&#39;, &#39;de-en&#39;, &#39;es-en&#39;, &#39;pl-en&#39;, &#39;zh-en&#39;]
# However it should be:
# [&#39;en&#39;]

# with the changes it is:
task = [t for t in MTEB_ENG_CLASSIC.tasks if t.metadata.name == &#34;STS22&#34;][0]
task.hf_subsets
# [&#39;en&#39;]
# eq. to
task = mteb.get_task(&#34;STS22&#34;, hf_subsets=[&#34;en&#34;])
# which you can also obtain using the exclusive_language_filter (though not if there was multiple english splits):
task = mteb.get_task(&#34;STS22&#34;, languages=[&#34;eng&#34;], exclusive_language_filter=True)
  • format

  • remove "en-ext" from AmazonCounterfactualClassification

  • fixed mteb(deu)

  • fix: simplify in a few areas (4a70e5d)

1.28.7

13 Jan 11:01
Compare
Choose a tag to compare

1.28.7 (2025-01-13)

Ci

  • ci: skip AfriSentiLID for now (#1785)

  • skip AfriSentiLID for now

  • skip relevant test case instead


Co-authored-by: Isaac Chung <[email protected]> (71dbd61)

Fix

  • fix: update max tokens for OpenAI (#1772)

update max tokens (0c5c3a5)

1.28.6

11 Jan 17:05
Compare
Choose a tag to compare

1.28.6 (2025-01-11)

Fix

  • fix: added annotations for training data (#1742)

  • fix: Added annotations for arctic embed models

  • added google and bge

  • added cohere

  • Added e5

  • added bge based model2vec

  • annotated oAI

  • format and update annotations (3f093c8)