LLM API Pricing Comparison (2026)
The per-million-token API price of every major large language model in one place: Claude, GPT-5, Gemini, Grok and DeepSeek. Type your monthly token volume below and the table ranks every model by your actual estimated bill. Nothing is uploaded or saved.
Estimate your monthly API bill
A token is roughly 4 characters (about 750 words per 1,000 tokens). The "Your cost" column and ranking update as you type. Estimate only; caching and batch discounts can lower it.
| Provider | Model | Input $/1M |
Output $/1M |
Context | Your cost ▼ /mo |
|---|---|---|---|---|---|
| Anthropic | Claude Haiku 4.5Fastest, cheapest Claude. | $1.00 | $5.00 | 200K | — |
| Anthropic | Claude Sonnet 4.6Best speed/intelligence balance. | $3.00 | $15.00 | 1M | — |
| Anthropic | Claude Opus 4.8Flagship Opus; 1M context at standard price. | $5.00 | $25.00 | 1M | — |
| Anthropic | Claude Fable 5Most capable Claude; thinking always on. | $10.00 | $50.00 | 1M | — |
| OpenAI | GPT-5 miniLow-cost workhorse. | $0.25 | $2.00 | 400K | — |
| OpenAI | GPT-5.4 miniCheaper mini tier. | $0.75 | $4.50 | 400K | — |
| OpenAI | GPT-5Original GPT-5. | $1.25 | $10.00 | 400K | — |
| OpenAI | GPT-5.4Previous flagship. | $2.50 | $15.00 | 400K | — |
| OpenAI | GPT-5.5Current flagship; cached input ~90% off. | $5.00 | $30.00 | 400K | — |
| Gemini 3.1 Flash-LiteCheapest Gemini. | $0.25 | $1.50 | 1M | — | |
| Gemini 3 Flash (Preview)Preview tier. | $0.50 | $3.00 | 1M | — | |
| Gemini 3.5 FlashCached input ~$0.15/M. | $1.50 | $9.00 | 1M | — | |
| Gemini 3.1 ProTiered: 2x in / 1.5x out above 200K tokens. | $2.00 | $12.00 | 1M | — | |
| xAI | Grok 4.1 FastBudget tier; cached input ~$0.20/M. | $0.20 | $0.50 | 1M | — |
| xAI | Grok 4.3Very low output price. | $1.25 | $2.50 | 1M | — |
| DeepSeek | DeepSeek V4 FlashCheapest here; cache hits ~98% off. | $0.14 | $0.28 | 128K | — |
| DeepSeek | DeepSeek V4 ProOpen-weight; aggressive caching. | $0.43 | $0.87 | 128K | — |
Standard list prices per 1,000,000 tokens, verified June 2026. Some models have tiered pricing (e.g. Gemini Pro charges more above 200K tokens) and most offer batch (~50% off) and cached-input (often 90%+ off) discounts not reflected here. Always confirm current pricing on the provider's own page.