Leaderboard
Models
Every model, on every GPU, with tokens per second, the composite score across all eight use cases, and the resulting dollars per million tokens. Click a column to sort. How we measure →
Loading…
Samples column: progress-events / score-rows. All data available as JSON or via the raw downloads on /methodology.