The AI inference demand index · June 24, 2026

AI tokens processed today

Tokens are the “barrels per day” of artificial intelligence. There is no official meter for global token consumption — so this is a transparent, fully-sourced estimate, modeled live from the latest disclosures.

~420 trillion
Estimated AI tokens the world processes today — best global estimate, modeled from disclosures and growing ~2.2×/year
420 trillion processed so far today · counting
300.1T
Disclosed floor today
400–432T
Independent estimates
Tokens per day over time
Trillions/day, linear scale. Dots are actual disclosures; lines interpolate between them. Hover for values; switch total / country / company.
Global floor
0 100 200 300 400 500 Trillions of tokens / day Feb '24AprJunAugOctDecFeb '25AprJunAugOctDecFeb '26AprJun indep. est. 400–432
Why the curve is this steep — and why 2024 looks near-zero. Two effects compound. The growth is real: China’s National Data Bureau reports a ~1,000× rise in two years (0.1→140T/day) and Google ~330× (9.7→3,200T/month). But the early floor is also tiny because almost no one disclosed token counts before 2025 — so the 2024 values are a sparse lower bound of who had reported, not a measure of true usage. The floor widens as more sources report.

How our floor compares to independent estimates

Our reported floor (300.1 T/day) only counts disclosures, so it sits below what independent analysts estimate for the true global total. Those estimates span the paid API market (~50T/day) up to all-surface throughput (~430T/day).

SourceScopeT/dayAs ofConf.
Tokens Per Day (this index) Reported floor (disclosures only) 300.1 2026-06-24 measured
Epoch AI / Exponential View All providers (global total) ~432 mid-2026 medium
a16z / OpenRouter LLM API market ~50 late-2025 medium
Epoch AI Frontier models 10–100 late-2025 medium
OpenRouter (1% extrapolation) Global inference ~400 late-2025 low

These are independent third-party estimates, included for comparison and not summed into our figures. Definitions differ — “all providers” includes multimodal/all-surface tokens; “API market” excludes captive first-party traffic. Full sources →


How the number is built

Three independent layers, each more uncertain than the last. They constrain each other — agreement across all three is what makes the headline credible.

Layer 1 · Floor

Reported floor

Only public disclosures, normalized to trillions/day at each disclosure’s period midpoint. Overlaps are removed so nothing is double-counted. A hard lower bound.

See the data →
Layer 2 · Estimate

Implied estimates

For big names that don’t disclose usable counts, we estimate by the method that fits — revenue-implied for usage-billed providers, usage-implied for free consumer products.

See the estimates →
Layer 3 · Check

Compute sanity check

An independent cross-check from the hardware side: accelerators × throughput × utilization. Demand sits at ~48% of mid-case capacity — physically comfortable.

Read the method →

What this is — and what it isn’t

It is a transparent floor-and-estimate for global AI token throughput, where every disclosure links to its source and an archived snapshot.

It isn’t an official total. Real usage is higher than the reported floor; the estimates are explicitly uncertain. We publish the lower bound and show our work rather than a single confident number.

How it’s calculated Browse the data


Sources & references

Every figure on this index is traceable to a primary source. The full citation list, with archived snapshots, lives on the Sources page.

Independent estimates

Research & academic

Analyst & industry

Compute & hardware

Sources do not all measure the same thing (input vs output tokens, one provider vs all surfaces, marketplace samples vs global totals, multimodal vs text). We preserve what each reported and flag the differences — see the methodology.