Mteb Versions Save

MTEB: Massive Text Embedding Benchmark

1.12.27

2 weeks ago

1.12.27 (2024-06-13)

Documentation

docs: Add points for #911 (#913)

Add poinhts (5deeb3c)

Fix

fix: Update annotations for English STS tasks (#908)
Update STS16STS.py
Update STS15STS.py
Update STS14STS.py
Update STS13STS.py
add points for 902&908 prs
Update STS16STS.py
Update STS14STS.py
Update STS14STS.py
Update STS16STS.py

Co-authored-by: Tikhonova Maria <[email protected]> (f1dd8bb)

Unknown

Update tasks table (844e743)
Update points table (cffd94c)

1.12.26

3 weeks ago

1.12.26 (2024-06-13)

Documentation

docs: Update annotations for multilingual STS tasks (#902)
Update STS12STS.py
Update STSBenchmarkMultilingualSTS.py
Update STS22CrosslingualSTS.py
Update STS17CrosslingualSTS.py
Update STS22CrosslingualSTS.py
Update STSBenchmarkMultilingualSTS.py
Update STS17CrosslingualSTS.py
Update STS12STS.py
Update STS17CrosslingualSTS.py
Update STSBenchmarkMultilingualSTS.py
Update STS22CrosslingualSTS.py

Co-authored-by: Tikhonova Maria <[email protected]> (4383fd3)

Fix

fix: Incorrect handling of qrel_revision fix: #909 (#911)
fix: #909
fix: #909
add: points
clean (63cd4b7)

Unknown

Update tasks table (30433a9)
Update points table (d771437)
(1) Add WebLINX Candidate Elements Reranking Task; (2) move convert_conv_history_to_query from mteb.evaluation.evaluators.RetrievalEvaluator to mteb.evaluation.evaluators.utils (#820)
Initial file (WIP)
Update dates, and indicate dotos
Update mteb/tasks/Reranking/eng/WebLINXCandidatesReranking.py
Move convert_conv_history_to_query from RetrievalEvaluator to utils
Add WebLINXCandidatesReranking to mteb.tasks.Reranking
Lint RetrievalEvaluator's imports
lint mteb.evaluation.evaluator.utils
Update dataset path, name and revision, n_samples and avg_character_lenght
Update revision, add load_data method
Fix typo
Change main score to mrr, since recall@10 is not supported
Add average results
Add points
Run make lint (e34ddaa)
Update tasks table (b16d7d5)

1.12.25

3 weeks ago

1.12.25 (2024-06-11)

Documentation

docs: Add points for paper writing (#901)
add points for paper writing
add PR number
correct typo (fbbc44b)

Fix

fix: Backfilled bibtex citations data (#900)
Added IndicNLP News Classificaiton
Added IndicNLP News Classificaiton
Added results
Updated dataset version
Small fixes
Small fix
Small fix
Updated results
Fix linting issues
Added points
Resolve conflict
Update 610.jsonl
Backfilled missing bibtex citations.
Backfilled missing bibtex citations.
Remove non-present files

Co-authored-by: Imene Kerboua <[email protected]> Co-authored-by: Kenneth Enevoldsen <[email protected]> (77d0e06)

Unknown

Update points table (a7ce58f)
Update tasks table (97eb8b3)

1.12.24

3 weeks ago

1.12.24 (2024-06-09)

Fix

fix: Add openai and voyage models (#887)
Add openai and voyage models
Add rate limit for voyage model
Add rate and token limit for voyage (ad9b3ce)

Unknown

Update points table (052c9dd)

1.12.23

3 weeks ago

1.12.23 (2024-06-08)

Documentation

docs: Update points.md (#890) (4318c82)

Fix

fix: abstention metric for small datasets (#893)
fix abstention bug
make lint (9d28296)

Unknown

Update tasks table (bab7503)
Update points table (c61ab09)
Added multilabel stratification to AbsTaskMultilabelClassification (#760)

merge conflicts fixed for stratification (d7dc9a8)

1.12.22

4 weeks ago

1.12.22 (2024-06-06)

Fix

fix: Add GritLM (#880)
Add GritLM
Remove unused imports
Use embedding mode to save memory
Format
Format
Add langs

Co-authored-by: Kenneth Enevoldsen <[email protected]>

Add langs

Co-authored-by: Kenneth Enevoldsen <[email protected]>

Change loader

Co-authored-by: Kenneth Enevoldsen <[email protected]> (0c99a4e)

1.12.21

4 weeks ago

1.12.21 (2024-06-05)

Documentation

docs: Added source for CmedqaRetrieval (#886)

docs: update CmedqaRetrieval description to specify source (3e910ff)

Fix

fix: Add error reporting for Retrieval (#873)
add error reporting.
no message
Update mteb/abstasks/AbsTaskRetrieval.py Co-authored-by: Isaac Chung <[email protected]>
add kwarg args
change to metadata.name
fix format
Update mteb/abstasks/AbsTaskRetrieval.py Co-authored-by: Isaac Chung <[email protected]>
change to explicit args
remove cmdline changes

Co-authored-by: Isaac Chung <[email protected]> (5397bd2)

1.12.20

4 weeks ago

1.12.20 (2024-06-05)

Fix

fix: Updated CLI for MTEB (#882)
Updated CLI for MTEB

It now includes three main commands one for running, one for getting an overview and one for creating the metadata for hf.

I also

added lower bound on dependencies as it caused a few issues.
deleted some results with a model revision attached (also created a fix to make sure that doesn't happen as much going forward)
Added a test for the cli
Added a quite extensive docstring to the CLI
made relevant changed to the documentation

ensure tests pass
minor changes to PR template
Minor changes to PR template
remove tmp file
fixed failing test and updated it to avoid future false positives
fix deprecation warning for logger.warn
don't clear path before running tests as it disturbs other tests when run in parallel (c6f618b)

1.12.19

4 weeks ago

1.12.19 (2024-06-05)

Documentation

docs: minor fix for point validation to avoid error when people split up points (7d0d631)

Fix

fix: Add CEDR, SensitiveTopics for multilabel and RuBQ for reranking (#881)
add russian reranking and multilabel tasks
fix import order
add results for baselines
add points
add points for review (9128df4)

Unknown

Update tasks table (7d3ce53)
Update points table (ef52f95)
Merge branch 'main' of https://github.com/embeddings-benchmark/mteb (7c7ee2b)
Updated CLI for MTEB

It now includes three main commands one for running, one for getting an overview and one for creating the metadata for hf.

I also

added lower bound on dependencies as it caused a few issues.
deleted some results with a model revision attached (also created a fix to make sure that doesn't happen as much going forward)
Added a test for the cli
Added a quite extensive docstring to the CLI
made relevant changed to the documentation (29b1c34)

1.12.18

4 weeks ago

1.12.18 (2024-06-05)

Fix

fix: Ensure result are consistently stored in the same way (#876)
Ensure result are consistently stored in the same way

(due to failing test): updated missing dataset references
(to test with more than one model) Added e5 models base and large
updated mteb.get_model to now include metadata in the model object
ensure that model name is always included when saving (with a default when it is not available)
use the ModelMeta for the model_meta.json

format
minor test fixes
docs: Minor updated to repro. workflow docs
fixed failing test
format
Apply suggestions from code review

Co-authored-by: Isaac Chung <[email protected]>

docs: update PR template
fix: Added benchmark object (#878)
removed duplicate task
Added benchmark object
removed import for duplicate task
fix dataset references
added seb
Added test for running benchmarks
changed tasks to be an iterable
format
Apply suggestions from code review

Co-authored-by: Niklas Muennighoff <[email protected]> Co-authored-by: Isaac Chung <[email protected]>

Co-authored-by: Isaac Chung <[email protected]> Co-authored-by: Niklas Muennighoff <[email protected]> (fb843d0)

Unknown

Update tasks table (dfbdfdc)