Not all benchmarks are created equal. Here is exactly how SEVENAI measures model performance — and why absolute scores matter far less than the direction of travel. By SEVENAI Editorial · May 17, 2026 · 11 min read · Methodology 30% Dimension 1 of 5 · Highest weighted Model Benchmarks Performance on MMLU, HumanEval, MATH, and frontier evals. Scored on absolute performance and week-over-week improvement. The single largest component of the SEVENAI Momentum Index. The most important number in the AI race is not a stock price, a revenue figure, or a headcount. It is a benchmark score — a single percentage point on a standardised evaluation that tells you, with more precision than any earnings call, whether a company's AI models are getting better or falling behind. Model benchmarks are the scoreboard of the AI race, and they account for 30% of the SEVENAI Momentum Index — the largest single dimension we track. But benchmarks are also the most...
Tracking who leads the AI race among Apple, Microsoft, Google, Amazon, Meta, Tesla and Nvidia