xavier collantes

LLM Leaderboards

By Xavier Collantes

It is hard to keep up with LLMs because major advancements happen weekly. These leaderboards help me track the latest models and their performance against each other.

LLM advancements are so fast, these leaderboards may not work or are migrated to another form.

HuggingFace Performance Leaderboard

Compares pricing, context window, pricing, and response time. This one I have seen the most commonly referred to.

huggingface.co/LLM-Performance-Leaderboard

MTEB Leaderboard

This leaderboard compares 100+ text and image embedding models across 1000+ languages. Great for finding many human languages.

huggingface.co/spaces/mteb/leaderboard

LLM Arena

Great for ranking use-specific LLMs: Text, Image, Vision, Search, Text-to-Image.

lmarena.ai/leaderboard

LLM Stats

Nice visual graph of LLM benchmarks. Contains the cutoff dates for training.

llm-stats.com

Vellum

Latest public benchmark performance for SOTA model versions released after April 2024. The data comes from model providers as well as independently run evaluations by Vellum or the open-source community.

vellum.ai/llm-leaderboard