API Evangelist Paper · Pricing
The Pricing of AI APIs
How the model labs price inference — and what every API provider can learn about pricing their own.
About this paper
I have been reading API pricing pages for sixteen years, and the conclusion that surprised me most is this: the most transparent class of API provider I found across thousands in the catalog was not the banks or the Fortune 500 — it was the AI labs. The companies selling the most hyped, most expensive APIs of the decade publish their prices more honestly than almost anyone. That inversion is the whole lesson.
This paper is two things at once: a market breakdown with real per-token rates across the flagship labs and the open-model hosts, the spread that shows you exactly where the commodity is, and the discount machinery (caching, batch, committed-use) the headline rate hides — and a pricing playbook for your own API, whether or not it touches AI. Because the lesson under the token economics is the one I have argued for sixteen years: pricing is not a revenue problem you solve once, it is a business-design problem you live with.
What's inside
- The unit — why the token won, and per-token as direct monetization
- The market — flagship rates, the open-model floor, and the spread
- The levers — caching, batch, flex, committed-use: the real price is never the headline
- The on-ramp — free tiers in a category that mostly skipped them
- Where pricing hides — opacity, repricing trauma, and what transparency buys
- Pricing your own API — plans before pricing, cost floor and value ceiling
- Anti-patterns I watch for
- Self-assessment
- Where this is going — machine-readable pricing, FinOps, and pay-per-outcome
What you get for $25.00
These papers are experience-based and vendor-neutral, distilled from the API Evangelist research at apievangelist.com. Questions before buying? [email protected].
arrow_back All papers