Template

Article generation

Load template

Long-form writing with moderately sized prompts and substantial output.

4k input · 2.5k output · 1.5k requests/day · 10% cache Batch off
Template

Customer support

Load template

Short multi-turn support replies with high daily traffic and reusable system prompts.

1.8k input · 600 output · 120k requests/day · 35% cache Batch off
Template

RAG Q&A

Load template

Retrieval-heavy prompts where context dominates cost and cache can matter.

6k input · 800 output · 30k requests/day · 45% cache Batch off
Template

Code completion

Load template

Interactive coding assistance with medium prompts and short outputs.

2.5k input · 350 output · 50k requests/day · 20% cache Batch off
Template

Batch summarization

Load template

Offline bulk summarization where batch mode and cache reuse are both realistic.

12k input · 1.6k output · 8k requests/day · 50% cache · batch on Batch on

Workload assumptions

Display currency

Keep workload inputs in focus with common switches first. Open the full currency search only when you need a less common display.

USD Default USD display
Common currencies
Search more currencies 164 supported currencies

Live FX rates come from Frankfurter. If conversion is unavailable, the calculator falls back to the model source currency and marks that row.

Models to compare

3 selected. Leave this blank and the calculator falls back to three live-snapshot defaults.

Quick token estimate

Optional helper for rough sizing before you set request token numbers.

Reset

Current workload summary

Scenario template Custom workload Directly edited inputs with no template preset.
Selected models 3
Display currency USD
Monthly request volume 300,000
Request shape 2,000 in · 1,000 out
Cache and batch 20% cache · Batch off
Budget Not set
FX mode Source currency
Lowest monthly cost GPT-5 Nano OpenAI · $144.60
Annualized estimate $1735.20 12 months at the same workload assumptions
Budget view Budget not comparable Add a monthly budget to see whether the workload fits and how many requests each model can support.
Best savings lever $5.40 Combined monthly savings from cache and batch on the current winner

Request details

Show request details
{
  "modelCodes": ["gpt-5-nano", "gpt-5-mini", "gpt-5"],
  "inputTokens": 2000,
  "outputTokens": 1000,
  "dailyRequests": 10000,
  "activeDays": 30,
  "cacheHitRatio": 0.20,
  "useBatch": false,
  "monthlyBudget": null,
  "displayCurrencyCode": "USD"
}
From English words 0
From Chinese chars 0
From pages 0
Estimated total tokens 0

Estimate only. Default assumptions: 1 token ~= 0.75 English words, 1 token ~= 1.5 Chinese characters, 1 page ~= 500 English words.

How the calculator interprets inputs

  • `Cache hit ratio` discounts only the cached share of input tokens.
  • `Batch discount` applies only when the stored snapshot lists a batch ratio for that model.
  • `Budget fit` means the maximum monthly requests you can afford with the exact request shape above.

Share and export

Scenario, workload inputs, selected models, display currency, and budget stay in the current query string. You can share the exact state by URL or export the current estimate as CSV.

Export CSV
Model Monthly cost Annual cost Delta vs winner 1k requests Blend / 1M Savings / month Budget status Budget fit
GPT-5 Nano
gpt-5-nano
OpenAI
Updated Apr 13, 22:05
Fallback source
PricePerToken OpenAI
Currency USD
Fallback snapshot from PricePerToken because OpenAI official pricing pages currently return an anti-bot challenge to server-side crawlers. Source updated at 2026-04-12T09:23:55.468156Z.
Cache saves $5.40 No batch savings
$144.60 $1735.20
Current winner
Lowest monthly cost in this scenario
$0.482 $0.138 $5.40
cache + batch combined
Set budget or use a comparable display currency
Set budget to see fit
GPT-5 Mini
gpt-5-mini
OpenAI
Updated Apr 13, 22:05
Fallback source
PricePerToken OpenAI
Currency USD
Fallback snapshot from PricePerToken because OpenAI official pricing pages currently return an anti-bot challenge to server-side crawlers. Source updated at 2026-04-12T09:23:55.468154Z.
Cache saves $12.00 No batch savings
$363.00 $4356.00
$218.40
more per month than the winner
$1.21 $0.344 $12.00
cache + batch combined
Set budget or use a comparable display currency
Set budget to see fit
GPT-5
gpt-5
OpenAI
Updated Apr 13, 22:05
Fallback source
PricePerToken OpenAI
Currency USD
Fallback snapshot from PricePerToken because OpenAI official pricing pages currently return an anti-bot challenge to server-side crawlers. Source updated at 2026-04-12T09:23:55.468149Z.
Cache saves $60.00 No batch savings
$1815.00 $21780.00
$1670.40
more per month than the winner
$6.05 $1.72 $60.00
cache + batch combined
Set budget or use a comparable display currency
Set budget to see fit

Estimate only. Actual billing may differ by tokenizer behavior, cache hit rate, and provider rules.