Question 1

How much does GPT-4o cost per 1000 requests?

Accepted Answer

GPT-4o costs $5 per million input tokens and $15 per million output tokens. A typical support prompt with 500 input tokens and 200 output tokens costs about $0.0055 per call — or $55 per 10,000 requests.

Question 2

How many tokens is my prompt?

Accepted Answer

A rough estimate is 1 token ≈ 4 characters (or about ¾ of a word). Paste your prompt into the calculator above to get an exact estimate.

Question 3

How can I reduce my LLM API costs?

Accepted Answer

Fine-tuning a smaller open-source model (Llama 3, Mistral) on your specific task can reduce costs by 80–95% vs GPT-4. The fine-tuned model learns your use case and runs at a fraction of the price.

Question 4

What is the difference between input and output tokens?

Accepted Answer

Input tokens are what you send to the model (your system prompt + user message). Output tokens are what the model generates in response. Both are billed separately — output tokens usually cost 3-4x more than input tokens.

Paste Your Prompt. See Real Costs.

Cost at 10K requests / month

Frequently Asked Questions

How much does GPT-4o cost per 1,000 requests?

How many tokens is my prompt?

How can I reduce my LLM API costs by 80%?

What is the difference between input and output tokens?

Does token count vary between GPT-4, Claude, and Gemini?

When does fine-tuning beat prompt engineering?