Data Explorer
Explore the Data
All 447 models with every available benchmark score — third-party evaluations, Epoch data, self-reported scores, pricing, and performance metrics.
Scores normalized to 0-100 where applicable. Toggle column groups to show/hide sections. Sorted by IRT (Third-Party) composite score by default.
Columns:
Download Data
All data freely available for research use.