Gemini 3.5 Flash Cost Calculator

Google's default production model. $1.50 input / $9 output per 1M tokens with a 1M context window.

Gemini 3.5 Flash shipped at Google I/O 2026 and immediately replaced Gemini 3.1 Pro as the default choice for production developers. At $1.50 input / $9 output per million tokens it is roughly 25% cheaper than 3.1 Pro while running 4× faster and scoring 76.2% on Terminal-Bench 2.1.

Use this calculator to model real-world spend, then compare against Gemini 3.1 Pro (deeper reasoning, $2/$12) and 3.1 Flash-Lite (cheapest at $0.25/$1.50).

Models included

  • Gemini 3.5 Flash (Google) — $1.50 input / $9.00 output per 1M tokens · 1M context window
  • Gemini 3.1 Pro (Google) — $2.00 input / $12.00 output per 1M tokens · 2M context window
  • Gemini 3.1 Flash-Lite (Google) — $0.25 input / $1.50 output per 1M tokens · 1M context window

Frequently asked questions

How much does Gemini 3.5 Flash cost?

Gemini 3.5 Flash costs $1.50 per million input tokens and $9 per million output tokens. Cached input is just $0.15 per 1M — a 90% discount.

Is Gemini 3.5 Flash cheaper than Gemini 3.1 Pro?

Yes — Flash is $1.50/$9 vs Pro at $2/$12 (and $4/$18 above 200K context). For most workloads Flash is 25% cheaper, while delivering better Terminal-Bench scores.

What is the Gemini 3.5 Flash context window?

Gemini 3.5 Flash supports a 1-million-token context window at flat pricing — no long-context surcharge.

How does Gemini 3.5 Flash compare to Claude Sonnet 4.6?

Flash is ~50% cheaper ($1.50/$9 vs $3/$15) and ships responses faster. Sonnet 4.6 typically scores slightly higher on agentic coding benchmarks but the gap has narrowed dramatically with 3.5 Flash.

Related calculators

  • Gemini 3.1 Pro Cost Calculator
  • Gemini vs Claude vs GPT cost
  • Cheapest AI models 2026