The production-default Claude model. $3 input / $15 output per 1M tokens with a 1M context window.
Claude Sonnet 4.6 hits the sweet spot for most production deployments: coding agents, customer chat, JSON extraction, and RAG. At $3 / $15 per million tokens it costs 40% less than Opus 4.7 while matching it on most non-reasoning benchmarks.
Combined with prompt caching (cached reads at $0.30/M) and batch pricing (50% off), Sonnet 4.6 is the default Claude model for teams scaling beyond a proof-of-concept.
Claude Sonnet 4.6 costs $3 per million input tokens and $15 per million output tokens. Cached input reads are $0.30 per 1M.
Yes. Sonnet 4.6 is $3/$15 per 1M vs GPT-5.5 at $5/$30 — roughly 40% cheaper across the board. Quality is comparable on most coding and chat benchmarks.
Claude Sonnet 4.6 supports a 1-million-token context window at standard pricing.
Move to Opus only when you measurably need its reasoning advantage — long multi-step research, deep code refactors, or hard math/logic. For 90% of production work, Sonnet ships the same answers at 40% less.