Free AI Token Counter & Cost Calculator

Count tokens and calculate API costs for GPT-4, Claude, Gemini, and more. Real-time pricing for all major LLM providers.

Enter Your Text

0 characters 0 words

🎯

Token Count

0 tokens

Enter text to count tokens

Calculate cost for:

Expected output tokens:

tokens

💰

Cost Estimate

Input (0 tokens) \$0.0000

Output (500 tokens) \$0.0000

Total per request \$0.0000

\$0.00 100 requests

\$0.00 1,000 requests

\$0.00 10,000 requests

Compare All Models

See how much your text costs across different AI providers

Model	Provider	Input Cost	Output Cost	Total Cost

How to Use This Token Counter

Paste your text — Enter your prompt, system message, or any text you want to analyze in the input box above.
View token count — The tool instantly shows how many tokens your text contains using OpenAI's tokenization algorithm.
Select your model — Choose the AI model you plan to use from the dropdown to see accurate pricing.
Set expected output — Enter how many tokens you expect the AI to generate in response for complete cost estimation.
Compare models — Scroll down to see cost comparison across all major providers to find the best value.

Understanding AI Tokens

What is a Token?

A token is the basic unit of text that AI models process. Think of it as a piece of a word. In English, one token is roughly:

~4 characters on average
~0.75 words (or 1 token ≈ 1.33 words)
For code: variable names, operators, and syntax are separate tokens

Token Examples

"Hello" = 1 token

"Hello, world!" = 4 tokens

"Supercalifragilistic" = 7 tokens

"你好" = 2 tokens

Why Tokens Matter

AI API costs are based on tokens, not words or characters. Understanding token counts helps you:

Estimate API costs before making calls
Optimize prompts to reduce expenses
Stay within model context limits (e.g., GPT-4's 128K limit)
Compare costs across different providers

AI Model Pricing Reference (January 2026)

Current pricing per 1,000 tokens (1K tokens ≈ 750 words):

🟢 OpenAI

Model	Input	Output
GPT-4.5 Preview	\$0.075	\$0.15
GPT-4o	\$0.0025	\$0.01
GPT-4o Mini	\$0.00015	\$0.0006
o3-mini	\$0.0011	\$0.0044
o1	\$0.015	\$0.06
o1-pro	\$0.15	\$0.60

🟣 Anthropic

Model	Input	Output
Claude 3.5 Sonnet	\$0.003	\$0.015
Claude 3.5 Haiku	\$0.0008	\$0.004
Claude 3 Opus	\$0.015	\$0.075
Claude 3 Haiku	\$0.00025	\$0.00125

🔵 Google

Model	Input	Output
Gemini 2.0 Flash	\$0.0001	\$0.0004
Gemini 2.0 Flash Lite	\$0.000075	\$0.0003
Gemini 1.5 Pro	\$0.00125	\$0.005
Gemini 1.5 Flash	\$0.000075	\$0.0003

🟠 DeepSeek

Model	Input	Output
DeepSeek V3	\$0.00014	\$0.00028
DeepSeek R1	\$0.00055	\$0.00219
R1 Distill 70B	\$0.00023	\$0.00069

🦙 Meta Llama

Model	Input	Output
Llama 3.3 70B	\$0.0008	\$0.0008
Llama 3.2 90B Vision	\$0.0012	\$0.0012
Llama 3.1 405B	\$0.003	\$0.003
Llama 3.1 8B	\$0.0001	\$0.0001

🔷 Others

Model	Input	Output
Mistral Large	\$0.002	\$0.006
Grok 2	\$0.002	\$0.01
Command R+	\$0.0025	\$0.01
Sonar Pro	\$0.003	\$0.015

Prices as of January 2026. Pricing changes frequently — always verify with official provider documentation. DeepSeek and Llama pricing varies by inference provider (Together, Groq, Fireworks, etc.).

Tips to Reduce Token Usage

✂️

Be Concise

Remove filler words, redundant instructions, and unnecessary context. "Summarize this:" works as well as "Please summarize the following text for me:"

📝

Use JSON for Data

Structured data in JSON format often uses fewer tokens than verbose natural language descriptions.

🔄

Reuse System Prompts

System prompts count as tokens on every request. Keep them minimal and move dynamic content to user messages.

📊

Limit Output

Use max_tokens parameter to cap response length. Output tokens are 2-3x more expensive than input.

Frequently Asked Questions

How much does GPT-4o cost per token?

GPT-4o costs \$0.0025 per 1,000 input tokens and \$0.01 per 1,000 output tokens. GPT-4o Mini is much cheaper at \$0.00015 input and \$0.0006 output per 1K tokens. The new GPT-4.5 Preview is \$0.075/\$0.15 per 1K tokens.

What's the cheapest AI model right now?

For quality-to-cost ratio, DeepSeek V3 (\$0.00014/\$0.00028 per 1K tokens) and Gemini 1.5 Flash 8B (\$0.0000375/\$0.00015 per 1K tokens) are incredibly cheap. For simple tasks, Llama 3.2 1B is nearly free at \$0.00004 per 1K tokens via most providers.

What is DeepSeek R1?

DeepSeek R1 is a reasoning model similar to OpenAI's o1, released in January 2025. It's open-source and extremely cost-effective at \$0.00055 input and \$0.00219 output per 1K tokens — roughly 27x cheaper than o1 for similar reasoning capabilities.

What's the difference between o1 and o3-mini?

o3-mini is OpenAI's latest reasoning model (January 2025), designed to be faster and cheaper than o1 while maintaining strong reasoning. o3-mini costs \$0.0011/\$0.0044 per 1K tokens compared to o1's \$0.015/\$0.06.

Is Claude 3.5 Sonnet better than GPT-4o?

They're comparable in capability. Claude 3.5 Sonnet (\$0.003/\$0.015) is slightly more expensive than GPT-4o (\$0.0025/\$0.01) but excels at longer outputs and coding tasks. Many developers use both depending on the task.