telexed ~ c / bc689d3c-199radar:50 · infra_saasLIVE
← back
NO.
#bc689d3c
Topic
INFRA & SAAS
Source
vercel_blog
Published
2026-05-12 04:00:00
Importance
★ 5/10 — radar 50
`AI Gateway` production index: spend goes to `Anthropic`, volume to `Google`
FIG-0061:1

`AI Gateway` production index: spend goes to `Anthropic`, volume to `Google`

Production traffic splits cleanly by risk tolerance, not by one winner. Route premium reasoning to Claude and bulk cheap calls to Gemini Flash; single-provider bets now look expensive.

[ KEY POINTS ]
  1. April 2026 spend share was 61% `Anthropic`, 21% `Google`, 12% `OpenAI`. Money is flowing to high-stakes calls, not raw traffic.
  2. Token volume flipped the ranking: 38% `Google`, 26% `Anthropic`, 13% `OpenAI`, 10% `xAI`. Cheap models are absorbing mass-market workloads.
  3. Within the same customer base, premium reasoning lands on Claude Opus while low-cost throughput lands on Gemini Flash. Model routing is becoming the default stack design.
  4. B2B tokens cost roughly 2x B2C per token. If mistakes create legal or operational risk, the savings from cheaper models disappear fast.
  5. Anthropic held 71% token share in back-office workloads but only 7% in consumer. Google concentrated in consumer with Gemini Flash at 28% of tokens and 15% of cost.
Originalvercel.com/blog/ai-gateway-production-indexRead original →

// related