Free AI Token Counter & Cost Calculator
Count tokens and calculate API costs for GPT-4, Claude, Gemini, and more. Real-time pricing for all major LLM providers.
Enter Your Text
Token Count
Enter text to count tokens
Cost Estimate
Compare All Models
See how much your text costs across different AI providers
| Model | Provider | Input Cost | Output Cost | Total Cost |
|---|
How to Use This Token Counter
- Paste your text — Enter your prompt, system message, or any text you want to analyze in the input box above.
- View token count — The tool instantly shows how many tokens your text contains using OpenAI's tokenization algorithm.
- Select your model — Choose the AI model you plan to use from the dropdown to see accurate pricing.
- Set expected output — Enter how many tokens you expect the AI to generate in response for complete cost estimation.
- Compare models — Scroll down to see cost comparison across all major providers to find the best value.
Understanding AI Tokens
What is a Token?
A token is the basic unit of text that AI models process. Think of it as a piece of a word. In English, one token is roughly:
- ~4 characters on average
- ~0.75 words (or 1 token ≈ 1.33 words)
- For code: variable names, operators, and syntax are separate tokens
Token Examples
Why Tokens Matter
AI API costs are based on tokens, not words or characters. Understanding token counts helps you:
- Estimate API costs before making calls
- Optimize prompts to reduce expenses
- Stay within model context limits (e.g., GPT-4's 128K limit)
- Compare costs across different providers
AI Model Pricing Reference (January 2026)
Current pricing per 1,000 tokens (1K tokens ≈ 750 words):
🟢 OpenAI
| Model | Input | Output |
|---|---|---|
| GPT-4.5 Preview | \$0.075 | \$0.15 |
| GPT-4o | \$0.0025 | \$0.01 |
| GPT-4o Mini | \$0.00015 | \$0.0006 |
| o3-mini | \$0.0011 | \$0.0044 |
| o1 | \$0.015 | \$0.06 |
| o1-pro | \$0.15 | \$0.60 |
🟣 Anthropic
| Model | Input | Output |
|---|---|---|
| Claude 3.5 Sonnet | \$0.003 | \$0.015 |
| Claude 3.5 Haiku | \$0.0008 | \$0.004 |
| Claude 3 Opus | \$0.015 | \$0.075 |
| Claude 3 Haiku | \$0.00025 | \$0.00125 |
| Model | Input | Output |
|---|---|---|
| Gemini 2.0 Flash | \$0.0001 | \$0.0004 |
| Gemini 2.0 Flash Lite | \$0.000075 | \$0.0003 |
| Gemini 1.5 Pro | \$0.00125 | \$0.005 |
| Gemini 1.5 Flash | \$0.000075 | \$0.0003 |
🟠 DeepSeek
| Model | Input | Output |
|---|---|---|
| DeepSeek V3 | \$0.00014 | \$0.00028 |
| DeepSeek R1 | \$0.00055 | \$0.00219 |
| R1 Distill 70B | \$0.00023 | \$0.00069 |
🦙 Meta Llama
| Model | Input | Output |
|---|---|---|
| Llama 3.3 70B | \$0.0008 | \$0.0008 |
| Llama 3.2 90B Vision | \$0.0012 | \$0.0012 |
| Llama 3.1 405B | \$0.003 | \$0.003 |
| Llama 3.1 8B | \$0.0001 | \$0.0001 |
🔷 Others
| Model | Input | Output |
|---|---|---|
| Mistral Large | \$0.002 | \$0.006 |
| Grok 2 | \$0.002 | \$0.01 |
| Command R+ | \$0.0025 | \$0.01 |
| Sonar Pro | \$0.003 | \$0.015 |
Prices as of January 2026. Pricing changes frequently — always verify with official provider documentation. DeepSeek and Llama pricing varies by inference provider (Together, Groq, Fireworks, etc.).
Tips to Reduce Token Usage
Be Concise
Remove filler words, redundant instructions, and unnecessary context. "Summarize this:" works as well as "Please summarize the following text for me:"
Use JSON for Data
Structured data in JSON format often uses fewer tokens than verbose natural language descriptions.
Reuse System Prompts
System prompts count as tokens on every request. Keep them minimal and move dynamic content to user messages.
Limit Output
Use max_tokens parameter to cap response length. Output tokens are 2-3x more expensive than input.
Frequently Asked Questions
How much does GPT-4o cost per token?
GPT-4o costs \$0.0025 per 1,000 input tokens and \$0.01 per 1,000 output tokens. GPT-4o Mini is much cheaper at \$0.00015 input and \$0.0006 output per 1K tokens. The new GPT-4.5 Preview is \$0.075/\$0.15 per 1K tokens.
What's the cheapest AI model right now?
For quality-to-cost ratio, DeepSeek V3 (\$0.00014/\$0.00028 per 1K tokens) and Gemini 1.5 Flash 8B (\$0.0000375/\$0.00015 per 1K tokens) are incredibly cheap. For simple tasks, Llama 3.2 1B is nearly free at \$0.00004 per 1K tokens via most providers.
What is DeepSeek R1?
DeepSeek R1 is a reasoning model similar to OpenAI's o1, released in January 2025. It's open-source and extremely cost-effective at \$0.00055 input and \$0.00219 output per 1K tokens — roughly 27x cheaper than o1 for similar reasoning capabilities.
What's the difference between o1 and o3-mini?
o3-mini is OpenAI's latest reasoning model (January 2025), designed to be faster and cheaper than o1 while maintaining strong reasoning. o3-mini costs \$0.0011/\$0.0044 per 1K tokens compared to o1's \$0.015/\$0.06.
Is Claude 3.5 Sonnet better than GPT-4o?
They're comparable in capability. Claude 3.5 Sonnet (\$0.003/\$0.015) is slightly more expensive than GPT-4o (\$0.0025/\$0.01) but excels at longer outputs and coding tasks. Many developers use both depending on the task.