MTEB: Massive Text Embedding Benchmark
Add poinhts (5deeb3c
)
fix: Update annotations for English STS tasks (#908)
Update STS16STS.py
Update STS15STS.py
Update STS14STS.py
Update STS13STS.py
add points for 902&908 prs
Update STS16STS.py
Update STS14STS.py
Update STS14STS.py
Update STS16STS.py
Co-authored-by: Tikhonova Maria <[email protected]> (f1dd8bb
)
docs: Update annotations for multilingual STS tasks (#902)
Update STS12STS.py
Update STSBenchmarkMultilingualSTS.py
Update STS22CrosslingualSTS.py
Update STS17CrosslingualSTS.py
Update STS22CrosslingualSTS.py
Update STSBenchmarkMultilingualSTS.py
Update STS17CrosslingualSTS.py
Update STS12STS.py
Update STS17CrosslingualSTS.py
Update STSBenchmarkMultilingualSTS.py
Update STS22CrosslingualSTS.py
Co-authored-by: Tikhonova Maria <[email protected]> (4383fd3
)
fix: Incorrect handling of qrel_revision fix: #909 (#911)
fix: #909
fix: #909
add: points
clean (63cd4b7
)
Update tasks table (30433a9
)
Update points table (d771437
)
(1) Add WebLINX Candidate Elements Reranking Task; (2) move convert_conv_history_to_query
from mteb.evaluation.evaluators.RetrievalEvaluator
to mteb.evaluation.evaluators.utils
(#820)
Initial file (WIP)
Update dates, and indicate dotos
Update mteb/tasks/Reranking/eng/WebLINXCandidatesReranking.py
Move convert_conv_history_to_query
from RetrievalEvaluator to utils
Add WebLINXCandidatesReranking to mteb.tasks.Reranking
Lint RetrievalEvaluator's imports
lint mteb.evaluation.evaluator.utils
Update dataset path, name and revision, n_samples and avg_character_lenght
Update revision, add load_data method
Fix typo
Change main score to mrr, since recall@10 is not supported
Add average results
Add points
Run make lint
(e34ddaa
)
Update tasks table (b16d7d5
)
docs: Add points for paper writing (#901)
add points for paper writing
add PR number
correct typo (fbbc44b
)
fix: Backfilled bibtex citations data (#900)
Added IndicNLP News Classificaiton
Added IndicNLP News Classificaiton
Added results
Updated dataset version
Small fixes
Small fix
Small fix
Updated results
Fix linting issues
Added points
Resolve conflict
Update 610.jsonl
Backfilled missing bibtex citations.
Backfilled missing bibtex citations.
Remove non-present files
Co-authored-by: Imene Kerboua <[email protected]>
Co-authored-by: Kenneth Enevoldsen <[email protected]> (77d0e06
)
4318c82
)fix: abstention metric for small datasets (#893)
fix abstention bug
make lint (9d28296
)
Update tasks table (bab7503
)
Update points table (c61ab09
)
Added multilabel stratification to AbsTaskMultilabelClassification (#760)
merge conflicts fixed for stratification (d7dc9a8
)
fix: Add GritLM (#880)
Add GritLM
Remove unused imports
Use embedding mode to save memory
Format
Format
Add langs
Co-authored-by: Kenneth Enevoldsen <[email protected]>
Co-authored-by: Kenneth Enevoldsen <[email protected]>
Co-authored-by: Kenneth Enevoldsen <[email protected]> (0c99a4e
)
docs: update CmedqaRetrieval description to specify source (3e910ff
)
fix: Add error reporting for Retrieval (#873)
add error reporting.
no message
Update mteb/abstasks/AbsTaskRetrieval.py Co-authored-by: Isaac Chung <[email protected]>
add kwarg args
change to metadata.name
fix format
Update mteb/abstasks/AbsTaskRetrieval.py Co-authored-by: Isaac Chung <[email protected]>
change to explicit args
remove cmdline changes
Co-authored-by: Isaac Chung <[email protected]> (5397bd2
)
fix: Updated CLI for MTEB (#882)
Updated CLI for MTEB
It now includes three main commands one for running, one for getting an overview and one for creating the metadata for hf.
I also
ensure tests pass
minor changes to PR template
Minor changes to PR template
remove tmp file
fixed failing test and updated it to avoid future false positives
fix deprecation warning for logger.warn
don't clear path before running tests as it disturbs other tests when run in parallel (c6f618b
)
7d0d631
)fix: Add CEDR, SensitiveTopics for multilabel and RuBQ for reranking (#881)
add russian reranking and multilabel tasks
fix import order
add results for baselines
add points
add points for review (9128df4
)
Update tasks table (7d3ce53
)
Update points table (ef52f95
)
Merge branch 'main' of https://github.com/embeddings-benchmark/mteb (7c7ee2b
)
Updated CLI for MTEB
It now includes three main commands one for running, one for getting an overview and one for creating the metadata for hf.
I also
29b1c34
)fix: Ensure result are consistently stored in the same way (#876)
Ensure result are consistently stored in the same way
format
minor test fixes
docs: Minor updated to repro. workflow docs
fixed failing test
format
Apply suggestions from code review
Co-authored-by: Isaac Chung <[email protected]>
docs: update PR template
fix: Added benchmark object (#878)
removed duplicate task
Added benchmark object
removed import for duplicate task
fix dataset references
added seb
Added test for running benchmarks
changed tasks to be an iterable
format
Apply suggestions from code review
Co-authored-by: Niklas Muennighoff <[email protected]> Co-authored-by: Isaac Chung <[email protected]>
Co-authored-by: Isaac Chung <[email protected]>
Co-authored-by: Niklas Muennighoff <[email protected]> (fb843d0
)
dfbdfdc
)