Google's default production model. $1.50 input / $9 output per 1M tokens with a 1M context window.
Gemini 3.5 Flash shipped at Google I/O 2026 and immediately replaced Gemini 3.1 Pro as the default choice for production developers. At $1.50 input / $9 output per million tokens it is roughly 25% cheaper than 3.1 Pro while running 4× faster and scoring 76.2% on Terminal-Bench 2.1.
Use this calculator to model real-world spend, then compare against Gemini 3.1 Pro (deeper reasoning, $2/$12) and 3.1 Flash-Lite (cheapest at $0.25/$1.50).
Gemini 3.5 Flash costs $1.50 per million input tokens and $9 per million output tokens. Cached input is just $0.15 per 1M — a 90% discount.
Yes — Flash is $1.50/$9 vs Pro at $2/$12 (and $4/$18 above 200K context). For most workloads Flash is 25% cheaper, while delivering better Terminal-Bench scores.
Gemini 3.5 Flash supports a 1-million-token context window at flat pricing — no long-context surcharge.
Flash is ~50% cheaper ($1.50/$9 vs $3/$15) and ships responses faster. Sonnet 4.6 typically scores slightly higher on agentic coding benchmarks but the gap has narrowed dramatically with 3.5 Flash.