Skip to content
AI

How to Use the AI Model Cost Calculator

AI

AI Cost Calculator

Updated:

3 min read

Why You Need a Cost Calculator

Estimating AI model API costs is rarely as straightforward as it seems. Beyond input and output token counts, factors like cache hits, model selection, and currency differences all affect your final bill. AI Cost Calculator brings these variables into one interface, letting you estimate costs before making a single API call.

Three Steps to Get Your Estimate

1. Choose a Calculation Mode

The homepage offers three scenario entry points:

  • Text Models: For estimating request costs of conversational models like Claude, GPT, and DeepSeek
  • Audio Models: Speech-to-text, TTS, and other audio API costs (coming soon)
  • Image Models: Image generation and editing API costs (coming soon)

Most users will start with Text Models.

2. Fill in Request Parameters

Enter three values in the calculator card:

ParameterDescriptionExample
Cache MissTokens processed for the first time1,000,000
Cache HitTokens read from cache0
OutputTokens generated by the model100,000

Each parameter is labeled with its unit (/1M tokens) for easy conversion.

3. Switch Currency

Click the CNY / USD button in the top-right corner to toggle between Chinese Yuan and US Dollars. Defaults to CNY, which is more convenient for domestic developers.

Compare Multiple Models at Once

The calculator supports side-by-side comparison of multiple models. Three default models are shown initially (DeepSeek V4 Pro, Claude Sonnet 4.6, GPT-5.4 Mini). Click + Add Model below the cards to select from the remaining models; if you want to review unit prices first, open the model pricing table or text models page.

Each model card displays:

  • Input price (cache miss / cache hit)
  • Output price
  • Total cost for this request
  • 30-day cost projection based on this request

Understanding Pricing Formulas

Different providers use different billing methods. The calculator has built-in formulas for each:

Standard Formula (GPT, DeepSeek, etc.):

Total Cost = (Miss × Input Price) + (Hit × Cached Input Price) + (Output × Output Price)

Anthropic Prompt Caching:

Total Cost = (Miss × Base Input Price) + (Hit × Cached Input Price) + (Output × Output Price)

Anthropic also charges a one-time cache creation fee, which is shown separately on the card.

DeepSeek Cache Hit/Miss: DeepSeek bills cached and non-cached inputs at different rates—the difference can be up to 10x, making caching highly valuable for high-frequency scenarios.

Practical Tips

  1. Fill in cache hits before calculating — If your app has a caching mechanism, don’t forget the Hit field. The savings can be substantial.
  2. Budget in CNY — Default CNY display makes it easy to compare with domestic cloud services.
  3. 30-day projection is just math — It’s a linear extrapolation from a single request, not a real usage forecast.
  4. Report incorrect pricing — There’s a “Report Error” link at the bottom. Submit feedback if you spot outdated pricing, or use the bill checking guide to decide whether the issue is pricing or usage.

Next Steps

Recommended