Massive Text Embedding Benchmark (MTEB)

MTEB is a framework designed to evaluate the performance of text embedding models across a wide range of tasks and datasets. It offers a standardized way to benchmark embeddings models, facilitating comparison and reproducibility. MTEB primarily focuses on large-scale multilingual evaluations, covering multiple languages and use cases. Wide Task Coverage: Includes benchmarks for various tasks like: Text classification Clustering Semantic search Pair classification (e.g., entailment) Reranking Summarization Supports a diverse set of applications where embeddings are critical.

Web site

Github repository

Massive Text Embedding Benchmark (MTEB)

Tech tags:

Related shared contents:

In productions with: