ABCDEFGHIJKLMNOPQRSTUVWXYZ
GLOSSARY

Token

DEFINITION

The basic unit of text that language models process. Tokens are roughly 3-4 characters or 0.75 words in English. LLM pricing and context window limits are measured in tokens.

Language models don't process text character by character or word by word — they use a tokenizer to split text into tokens. "hamburger" might be one token, while "antidisestablishmentarianism" might be five. Spaces, punctuation, and special characters are often their own tokens.

Understanding tokens is critical for: (1) Estimating API costs — Claude Sonnet costs $3/M input tokens + $15/M output tokens. (2) Managing context windows — 200K tokens ≈ 150,000 words ≈ 500 pages of text. (3) Debugging truncation issues when prompts get cut off.

Rule of thumb: 1,000 tokens ≈ 750 words. A typical business email is ~300 tokens; a 10-page report is ~4,000 tokens; an entire book is ~75,000 tokens.

Tools That Use Token

C
Claude
9.4/10

Anthropic's AI assistant with industry-leading reasoning and safety

Free / $20/mo Pro / API from $3/M tokensView Review →
C
ChatGPT
8.8/10

OpenAI's AI assistant powering 100M+ users worldwide

Free / $20/mo Plus / API from $0.15/M tokensView Review →