Question 1

Is the token count exact for every model?

Accepted Answer

GPT-4o uses a real tokenizer loaded on demand. Claude, Llama 3, Gemini, and Mistral use calibrated browser-side estimates because their exact production tokenizers are not publicly shipped for identical local use.

Question 2

Does my prompt leave the browser?

Accepted Answer

No. Text is processed locally in your browser, and large inputs move to a Web Worker so the interface stays responsive.

Question 3

How is cost estimated?

Accepted Answer

Cost uses the selected model profile and input tokens per million. It is designed for planning and comparison, not billing reconciliation.

Question 4

Can I compare token density between models?

Accepted Answer

Yes. The comparison table shows token count, token-per-word ratio, and estimated input cost for each supported model family.

LLM Token Counter