AI Token Cost Calculator — Compare API Pricing for GPT, Claude & Gemini
AI API costs are charged per token — roughly 4 characters or 0.75 words. Pricing varies by a factor of 1,000× across models: Gemini 1.5 Flash costs $0.075 per million input tokens while Claude Opus 4 costs $15/M. This calculator computes exact per-request, daily, and monthly costs for 13 major models including GPT-4o, Claude, Gemini, Llama, and Mistral.
How to use AI Token Cost Calculator
Select a model
Choose your AI model from the grouped list. Models are organized by provider — OpenAI, Anthropic, Google, Meta, and Mistral.
Enter token counts
Input the average number of input tokens (your prompt) and output tokens (the model's response) per API request.
Set daily request volume
Enter how many API calls you expect per day. The tool calculates per-request, daily, and 30-day monthly costs automatically.
AI Model Pricing Comparison (per 1M tokens, May 2026)
| Model | Provider | Input $/1M | Output $/1M | Best For |
|---|---|---|---|---|
| GPT-5.5 | OpenAI | $5.00 | $30.00 | Current OpenAI flagship |
| GPT-5 | OpenAI | $1.25 | $10.00 | Best value in GPT-5 family |
| GPT-5 mini | OpenAI | $0.25 | $2.00 | Affordable GPT-5 tier |
| GPT-5 nano | OpenAI | $0.05 | $0.40 | Ultra-cheap, high-volume tasks |
| GPT-4o | OpenAI | $2.50 | $10.00 | Multimodal, proven general-purpose |
| Claude Sonnet 4 | Anthropic | $3.00 | $15.00 | Code, long-context reasoning |
| Claude Haiku 4 | Anthropic | $0.80 | $4.00 | Fast, affordable Anthropic tier |
| Gemini 3.5 Flash | $1.50 | $9.00 | Latest Gemini, balanced perf | |
| Gemini 3.1 Lite | $0.25 | $1.50 | Cheapest Gemini 3 tier | |
| DeepSeek V4 Pro | DeepSeek | $1.74 | $3.48 | Strong reasoning, low output cost |
| DeepSeek V4 Flash | DeepSeek | $0.14 | $0.28 | Ultra-cheap, fast inference |
| Llama 3.3 70B | Meta/Groq | $0.59 | $0.79 | Open-weight, self-hostable |
Frequently asked questions
A token is the basic unit of text that AI language models process. In English, one token is approximately 4 characters or 0.75 words. "Hello world" is 2 tokens; a typical paragraph of 100 words is about 130 tokens. Different tokenizers vary slightly — OpenAI uses cl100k_base for GPT-4o and Anthropic uses its own tokenizer. Both providers offer free tokenizer tools to count exact tokens for your specific text before committing to API costs at scale.
You might also need
Further reading
Authority documentation and specifications behind this tool.
Need this built into your product?
We design and build custom software — SaaS platforms, MVPs, AI agents, and web apps.
Have a project in mind?
We turn ideas into production-ready software — SaaS, web apps, mobile, and AI agents. Fixed price. Committed timeline. No surprises.