Question 1

How are LLM API tokens counted?

Accepted Answer

Tokens are sub-word units — roughly 4 characters or 0.75 words in English. Our estimator gives a ±10% approximation. Exact counts depend on each model's tokenizer.

Question 2

Why do output tokens cost more than input tokens?

Accepted Answer

Generating text requires more computation than reading it. Output tokens go through the full autoregressive decoding process, which is inherently more expensive per token.

Question 3

What is cached input pricing?

Accepted Answer

Some providers (OpenAI, Anthropic, DeepSeek) offer reduced prices when re-sending the same context prefix. This is useful for applications with large system prompts that rarely change.

Question 4

Which LLM API is cheapest?

Accepted Answer

For most use cases in 2026, Gemini 2.0 Flash and Mistral Small offer the lowest per-token costs. DeepSeek V3 is also very competitive. The cheapest option depends on your quality requirements.

Question 5

How often do LLM API prices change?

Accepted Answer

Prices drop frequently — typically every 3-6 months. OpenAI and Anthropic have both cut prices multiple times. We verify prices monthly but recommend checking provider websites.

LLM API Pricing & Token Calculator

How Token Pricing Works

How do you choose the right LLM model for your budget?

Frequently Asked Questions

How are LLM API tokens counted?

Why do output tokens cost more than input tokens?

What is cached input pricing?

Which LLM API is cheapest?

How often do LLM API prices change?

Related Tools