AI API Cost Calculator
Estimate monthly AI API costs from provider, model, requests per day, average input tokens, average output tokens, cached ratio, and days per month. Start from the defaults to see daily, monthly, and yearly cost immediately.
If you searched for an AI API cost calculator, start here: the calculator turns token prices into business spend before showing price-table details.
Scenario templates
Start from a realistic workload instead of a blank token form. Templates only preload values, so you can still edit everything after loading.
Article generation
Long-form writing with moderately sized prompts and substantial output.
Customer support
Short multi-turn support replies with high daily traffic and reusable system prompts.
RAG Q&A
Retrieval-heavy prompts where context dominates cost and cache can matter.
Code completion
Interactive coding assistance with medium prompts and short outputs.
Batch summarization
Offline bulk summarization where batch mode and cache reuse are both realistic.
Current workload summary
Default estimate result
Current winner: Gemini · Gemini 2.5 Flash-Lite
Cost Breakdown
Gemini 2.5 Flash-Lite is the current estimate at USD 169.2 per month for this workload.
Cheaper Alternatives
Request details
Show request details
{
"modelCodes": ["gemini-2.5-flash-lite"],
"inputTokens": 2000,
"outputTokens": 1000,
"dailyRequests": 10000,
"activeDays": 30,
"cacheHitRatio": 0.20,
"useBatch": false,
"monthlyBudget": null,
"displayCurrencyCode": "USD"
}
Token estimate helper
Useful when you know document size before you know request tokens.
Estimate only. Default assumptions: 1 token ~= 0.75 English words, 1 token ~= 1.5 Chinese characters, 1 page ~= 500 English words.
How the calculator interprets inputs
- `Cached ratio` discounts only the cached share of input tokens.
- `Batch discount` applies only when the stored snapshot lists a batch ratio for that model.
- `Budget fit` means the maximum monthly requests you can afford with the exact request shape above.
Share and export
Scenario, workload inputs, selected models, display currency, and budget stay in the current query string. You can share this exact AI API Cost Calculator state by URL or export the current estimate as CSV.
Estimated results
Monthly cost stays sorted low to high. Daily, monthly, yearly, and breakdown fields answer the workload cost before price-table details.
| Model | Daily cost | Monthly cost | Yearly cost | Delta vs winner | Cost breakdown | 1k requests | Blend / 1M | Savings / month | Budget status | Budget fit |
|---|---|---|---|---|---|---|---|---|---|---|
|
Gemini 2.5 Flash-Lite
gemini-2.5-flash-lite
Gemini
Updated Mar 30, 02:54
Official source
Gemini Developer
Currency USD
standard input=$0.10 (text / image / video) | $0.30 (audio); standard output=$0.40; cache=$0.01 (text / image / video) | $0.03 (audio) | $1.00 / 1,000,000 tokens per hour (storage price)
Cache saves $10.80
No batch savings
|
$5.64 | $169.20 | $2030.40 |
Current winner
Lowest monthly cost in this scenario
|
Input $0.0002
Output $0.0004
Cached input $0.0000
|
$0.564 | $0.175 |
$10.80
cache + batch combined
|
Set budget or use a comparable display currency
|
Set budget to see fit
|
Estimate only. Actual billing may differ by tokenizer behavior, cache hit rate, and provider rules.