It is hard to keep up with LLMs because major advancements happen weekly. These
leaderboards help me track the latest models and their performance against each
other.
LLM advancements are so fast, these leaderboards may not work or are
migrated to another form.
HuggingFace Performance Leaderboard
Compares pricing, context window, pricing, and response time. This one I have
seen the most commonly referred to.
Latest public benchmark performance for SOTA model versions released after April 2024. The data comes from model providers as well as independently run
evaluations by Vellum or the open-source community.