AI API Cost Calculator Estimate monthly AI API costs Cheaper alternatives

AI API Cost Calculator

Estimate monthly AI API costs from provider, model, requests per day, average input tokens, average output tokens, cached ratio, and days per month. Start from the defaults to see daily, monthly, and yearly cost immediately.

View model list Need side-by-side comparison?

Estimate monthly AI API costs Default workload is already calculated Adjust provider, model, requests per day, average tokens, cached ratio, or days per month to replace the example with your own workload.

Use compare when The workload is fixed but the winning model is still unknown Compare is better for ranking several candidate models once your calculator inputs are stable.

If you searched for an AI API cost calculator, start here: the calculator turns token prices into business spend before showing price-table details.

Open compare with this scenario Review methodology

Template

Article generation

Load template

Long-form writing with moderately sized prompts and substantial output.

4k input · 2.5k output · 1.5k requests/day · 10% cache Batch off

Template

Customer support

Load template

Short multi-turn support replies with high daily traffic and reusable system prompts.

1.8k input · 600 output · 120k requests/day · 35% cache Batch off

Template

RAG Q&A

Load template

Retrieval-heavy prompts where context dominates cost and cache can matter.

6k input · 800 output · 30k requests/day · 45% cache Batch off

Template

Code completion

Load template

Interactive coding assistance with medium prompts and short outputs.

2.5k input · 350 output · 50k requests/day · 20% cache Batch off

Template

Batch summarization

Load template

Offline bulk summarization where batch mode and cache reuse are both realistic.

12k input · 1.6k output · 8k requests/day · 50% cache · batch on Batch on

Estimate monthly AI API costs

Display currency

Keep workload inputs in focus with common switches first. Open the full currency search only when you need a less common display.

USD Default USD display

Common currencies

Search more currencies 165 supported currencies

Search all currencies

All supported currencies

Live FX rates come from Frankfurter. If conversion is unavailable, the calculator falls back to the model source currency and marks that row.

Average input tokens Average output tokens Requests per day Days per month Cached ratio Monthly budget (optional) Apply batch discount where supported

Quick token estimate

Optional helper for rough sizing before you set request token numbers.

English words Chinese characters Pages

Current workload summary

Scenario template Custom workload Directly edited inputs with no template preset.

Selected models 1

Display currency USD

Monthly request volume 300,000

Request shape 2,000 in · 1,000 out

Cache and batch 20% cache · Batch off

Budget Not set

FX mode Source currency

Daily Cost $29.92 10,000 requests per day

Monthly Cost $897.60 30 days per month

Yearly Cost $10771.20 12 months at the same workload assumptions

Cost Breakdown

Input Cost $0.0005 Uncached input share per request after 20% cached ratio

Output Cost $0.0025 1,000 average output tokens

Cached Input Cost $0.0000 120,000,000 cached input tokens per month

Cached Input Savings $32.40 Monthly reduction from cached input pricing when the provider lists it

Gemini 2.5 Flash is the current estimate at USD 897.6 per month for this workload.

Cheaper Alternatives

No cheaper comparable option in the selected set Add more provider and model choices to compare the same workload against lower monthly cost estimates.

Request details

Show request details

{
  "modelCodes": ["gemini-2.5-flash"],
  "inputTokens": 2000,
  "outputTokens": 1000,
  "dailyRequests": 10000,
  "activeDays": 30,
  "cacheHitRatio": 0.20,
  "useBatch": false,
  "monthlyBudget": null,
  "displayCurrencyCode": "USD"
}

From English words 0

From Chinese chars 0

From pages 0

Estimated total tokens 0

Estimate only. Default assumptions: 1 token ~= 0.75 English words, 1 token ~= 1.5 Chinese characters, 1 page ~= 500 English words.

How the calculator interprets inputs

`Cached ratio` discounts only the cached share of input tokens.
`Batch discount` applies only when the stored snapshot lists a batch ratio for that model.
`Budget fit` means the maximum monthly requests you can afford with the exact request shape above.

Open glossary Methodology

Share and export

Scenario, workload inputs, selected models, display currency, and budget stay in the current query string. You can share this exact AI API Cost Calculator state by URL or export the current estimate as CSV.

Export CSV

Model	Daily cost	Monthly cost	Yearly cost	Delta vs winner	Cost breakdown	1k requests	Blend / 1M	Savings / month	Budget status	Budget fit
Gemini 2.5 Flash gemini-2.5-flash Gemini Updated Mar 30, 02:54 Official source Gemini Developer Currency USD standard input=$0.30 (text / image / video) \| $1.00 (audio); standard output=$2.50; cache=$0.03 (text / image / video) \| $0.1 (audio) \| $1.00 / 1,000,000 tokens per hour (storage price) Cache saves $32.40 No batch savings	$29.92	$897.60	$10771.20	Current winner Lowest monthly cost in this scenario	Input $0.0005 Output $0.0025 Cached input $0.0000	$2.99	$0.850	$32.40 cache + batch combined	Set budget or use a comparable display currency	Set budget to see fit

Estimate only. Actual billing may differ by tokenizer behavior, cache hit rate, and provider rules.

AI API Cost Calculator

Scenario templates

Article generation

Customer support

RAG Q&A

Code completion

Batch summarization

Estimate monthly AI API costs

Current workload summary

Default estimate result

Cost Breakdown

Cheaper Alternatives

Request details

Token estimate helper

How the calculator interprets inputs

Share and export

Estimated results