Live model prices

Every major AI model.
One honest table.

Prices update continuously from the providers' own pages plus the OpenRouter live feed. Anything that drifts more than 20% from our hand-checked seed gets a flag — we don't trust automated data more than we trust ourselves.

Loading… · · JSON API

Value picks — what we'd actually use today

Cheap chat / high volume

DeepSeek V4 Flash

$0.14 in · $0.28 out · cache $0.003

Wildly cheap. With cached input it's basically free. Western alternative: Gemini 2.5 Flash-Lite ($0.10 / $0.40).

Daily-driver chat

GPT-4.1 or Claude Sonnet 4.6

$2-3 in · $8-15 out

GPT-4.1 wins on price + 1M context. Sonnet wins on instruction-following and aggressive caching (cache read $0.30).

Reasoning / hard problems

o4-mini

$1.10 in · $4.40 out

Same price as o3-mini, newer generation. For code and math it punches above its price class.

Embeddings

text-embedding-3-small

$0.02 in

The cheapest serious embeddings model. Use 3-large ($0.13) only if your semantic search actually fails at small.

Honest take: prices below are correct, but the "best model" changes monthly. This shortlist updates with the same cron that updates the table.

Provider	Model	Input $/1M	Output $/1M	Cached $/1M	Context	Type	Try
Loading prices…

Affiliate disclosure: "Try" buttons link to each provider's own signup page. When we have an affiliate code we'll mark the button "Try (aff)" and use a sponsored rel attribute — never silently. Right now no link is affiliated. Pricing is identical for you either way.

How we keep this honest

01
Seed: hand-verified, monthly

Every price in this table starts from data/pricing.json in our repo, checked against each provider's official pricing page.
02
Live overlay: OpenRouter, every hour

OpenRouter aggregates pricing from ~300 models with community updates. Our endpoint pulls and merges every 60 minutes.
03
Drift guard: 20% threshold

If the live price for a model differs from our seed by more than 20%, we keep the seed and flag it. Catches OpenRouter glitches and unannounced provider changes.
04
JSON API: free for everyone

Hit GET /api/pricing and you get the same data we use, with a 10-minute CDN cache. Build your own dashboard. Be our guest.

Value picks — what we'd actually use today

DeepSeek V4 Flash

GPT-4.1 or Claude Sonnet 4.6

o4-mini

text-embedding-3-small

How we keep this honest

Seed: hand-verified, monthly

Live overlay: OpenRouter, every hour

Drift guard: 20% threshold

JSON API: free for everyone