Question 1

How do I count GPT-4o tokens?

Accepted Answer

Install tokcount with `npm i -g @v0idd0/tokcount` then run `tokcount prompt.md --model gpt-4o`. This uses the o200k_base encoding (identical to the official OpenAI tiktoken library) and returns exact token count plus cost at current OpenAI API pricing.

Question 2

Does tokcount give the same token count as tiktoken for GPT-4o?

Accepted Answer

Yes. tokcount uses gpt-tokenizer, a JavaScript port of tiktoken that produces byte-exact results for GPT-3.5, GPT-4, and GPT-4o using the cl100k_base and o200k_base encodings. The count will be identical to what the OpenAI API bills.

Question 3

What is GPT-4o's context window?

Accepted Answer

GPT-4o supports a 128,000-token context window. GPT-4o mini supports 128,000 tokens input and 16,384 tokens output. o3 supports 200,000 tokens input. tokcount shows the context window percentage used alongside the count.

Question 4

How much does GPT-4o cost per token?

Accepted Answer

As of mid-2026: GPT-4o — $2.50 per 1M input tokens, $10.00 per 1M output tokens. GPT-4o mini — $0.15 per 1M input tokens, $0.60 per 1M output tokens. o3 — $10.00 per 1M input tokens, $40.00 per 1M output tokens. o4-mini — $1.10 per 1M input tokens, $4.40 per 1M output tokens. Cached inputs are billed at 50% discount.

Question 5

What encoding does GPT-4o use?

Accepted Answer

GPT-4o uses the o200k_base encoding (byte pair encoding). GPT-3.5 and GPT-4 use cl100k_base. Both use the same underlying BPE algorithm but have different vocabularies. tokcount automatically selects the correct encoding for each model.

GPT-4o token counter.

OpenAI API pricing — 2026.

GPT-4o — most used

GPT-4o mini

o3

Count GPT-4o tokens in the browser.

GPT-4o tokenization — FAQ.