How Much Does Claude Sonnet Cost | API and Pro/Max Monthly Plans Explained
Claude Sonnet 4.6 costs $3 input / $15 output per million tokens (USD, ex-tax) via API, and starts at $20/month for individual users on the Pro subscription plan (equivalent to $17/month on an annual contract). This article consolidates all the pricing information you need when using Sonnet — API rates, subscription plans, Batch / prompt caching discounts, the 1M token context window, and comparisons with other models — all based on Anthropic's official pricing page.
Claude Sonnet 4.6 API pricing is $3 input / $15 output per million tokens (USD, ex-tax) — 40% cheaper than Opus 4.7 ($5 input / $25 output) while offering more capability than Haiku 4.5 ($1 input / $5 output). The context window is 1M tokens with no additional high-context surcharge beyond 200K, which is a significant advantage in the current lineup.
For subscription users, individuals need at least Pro ($20/month, or $17/month annually) — this is the entry tier that also unlocks Claude Code, Cowork, unlimited Projects, and Research. Max starts at $100/month for heavy users, Team is $20–$25/seat/month for groups of 5 or more, and Enterprise (custom quote) is available for compliance-heavy organizations.
Batch API cuts Sonnet 4.6 to $1.5 input / $7.5 output — a 50% discount — while prompt caching cache reads drop to just 10% ($0.30/MTok). Since Batch and caching discounts stack, large-scale batches and repeated references to long documents can bring costs down to practical levels, making it easier to use Sonnet continuously while keeping expenses under control.
目次 (12)
- Quick Summary — Sonnet 4.6 Costs $3/$15 via API, from $20/month on Pro
- Claude Sonnet 4.6 API Pricing — $3 Input / $15 Output and Caching Discounts
- Using Sonnet on a Subscription — Pro $20, Max $100, Team $25/seat
- Sonnet 4.6 vs. Opus 4.7 vs. Haiku 4.5 — Price Comparison
- No Extra Charge for the 1M Context Window — Real Cost Examples for Long Documents
- 50% Off with Batch API — Run Sonnet 4.6 at Half Price
- 3 Ways to Reduce Sonnet Costs
- 1. Always Enable Prompt Caching
- 2. Route Non-Urgent Processing to Batch API
- 3. Route Tasks to the Right Model
- Summary — Sonnet 4.6 Pricing at a Glance
- Sources and Reference Links
Quick Summary — Sonnet 4.6 Costs $3/$15 via API, from $20/month on Pro
There are two main ways to use Claude Sonnet 4.6, each with a different pricing model: pay-as-you-go via API, or subscribe to a Claude plan (Pro / Max / Team / Enterprise) and use it through the chat UI. Key points for this section are summarized below.
| Method | Price (USD, ex-tax) | Billing Unit | Primary Use Case |
|---|---|---|---|
| Claude API | $3 input / $15 output (per million tokens) | Usage-based | App integration, automation |
| Pro | $20/month ($17/month annually) | Monthly | Individual professional use |
| Max | From $100/month | Monthly | Heavy individual users |
| Team (Standard seat) | $25/seat/month ($20 annually) | Monthly | Teams of 5 or more |
| Team (Premium) | $125/seat/month ($100 annually) | Monthly | Heavy-use teams |
| Enterprise | Custom quote | Annual contract | Large enterprises, regulated industries |
Source: Anthropic Pricing / Claude Docs Pricing (accessed: 2026-05-17)
The simple decision rule is: use the API if you're calling Sonnet from your own code, and a subscription if you're chatting with Sonnet through the browser or official apps. The API is easy to get started with since you only pay for what you use, but costs can grow quickly if you underestimate token usage. Subscriptions offer predictable monthly billing, though Pro and Max differ in the amount of compute available (i.e., access to Sonnet and other high-performance models).
Claude Sonnet 4.6 API Pricing — $3 Input / $15 Output and Caching Discounts
When using Claude Sonnet 4.6 via API, pricing is based on the number of tokens processed per million tokens (MTok) for both input and output. Key points for this section are summarized below.
| Item | Sonnet 4.6 Price | Multiplier vs. Standard |
|---|---|---|
| Input tokens (Base) | $3 / MTok | 1.0x |
| Output tokens | $15 / MTok | 1.0x |
| 5-minute cache write | $3.75 / MTok | 1.25x |
| 1-hour cache write | $6 / MTok | 2.0x |
| Cache read (hit) | $0.30 / MTok | 0.1x |
Source: Claude Docs — Pricing (accessed: 2026-05-17)
The impact of prompt caching is particularly noteworthy. For apps that repeatedly send long system prompts or large reference documents, enabling caching means the second request onward reads the same content at just 10% of the standard rate (i.e., $0.30/MTok for Sonnet). A single cache hit with the 5-minute cache is enough to recover the write cost, and even the 1-hour cache pays for itself after two hits.
If you're new to the concept of tokens: in English, roughly "1 token ≈ 4 characters ≈ 0.75 words"; in Japanese, approximately "1 character ≈ 1–1.5 tokens." For example, sending a 10,000-character Japanese article as input would be roughly 10,000–15,000 tokens, costing around $0.03–$0.045.
Using Sonnet on a Subscription — Pro $20, Max $100, Team $25/seat
If you're using Sonnet through the Claude.ai chat UI (browser, desktop app, or mobile app), you'll need a monthly subscription plan. Key points for this section are summarized below.
| Plan | Monthly (USD, ex-tax) | Annual (monthly equivalent) | Sonnet Access | Claude Code |
|---|---|---|---|---|
| Free | $0 | $0 | Limited | No |
| Pro | $20/month | ~$17/month | Yes | Yes |
| Max 5x | From $100/month | — | Yes (high frequency) | Yes |
| Max 20x | From $100/month* | — | Yes (very high frequency) | Yes |
| Team Standard | $25/seat/month | $20/seat/month | Yes | Yes |
| Team Premium | $125/seat/month | $100/seat/month | Yes (heavy) | Yes |
| Enterprise | Custom quote | Custom quote | Yes | Yes |
Source: Anthropic Pricing (accessed: 2026-05-17)
*Max 5x and Max 20x are both listed as "From $100/month" on the official page, but are priced differently. Always check the official page for the exact monthly rate.
Here are guidelines for choosing the right plan:
- If you just want to try Sonnet occasionally, Free is sufficient (though usage limits are strict)
- For daily professional use, Pro ($20/month) is the minimum — Claude Code, unlimited Projects, and Research are all unlocked starting at Pro
- If you spend hours each day conversing with Sonnet or running Claude Code for extended periods, consider Max
- Organizations with 5+ users who want to move away from a shared account should use Team (includes SSO and audit logs)
- Industries requiring HIPAA, ZDR (Zero Data Retention), SCIM (automated identity management), or IP restrictions should use Enterprise
Sonnet 4.6 vs. Opus 4.7 vs. Haiku 4.5 — Price Comparison
Seeing Sonnet's price in context with the higher-tier Opus and lower-tier Haiku makes it easier to assess its value. Key points for this section are summarized below.
| Model | Input $/MTok | Output $/MTok | Context Window | Extended Thinking | Adaptive Thinking |
|---|---|---|---|---|---|
| Claude Opus 4.7 | $5 | $25 | 1M tokens | No | Yes |
| Claude Sonnet 4.6 | $3 | $15 | 1M tokens | Yes | Yes |
| Claude Haiku 4.5 | $1 | $5 | 200K tokens | Yes | No |
Source: Models overview (accessed: 2026-05-17)
This table reflects information as of May 2026. Opus 4.8 has since been released, and the current top model has changed. For the latest specs and pricing, please check Models overview and claude.com/pricing.
Glossary (comparison table): Extended Thinking = a reasoning mode that outputs the thinking process explicitly / Adaptive Thinking = a feature that automatically adjusts the depth of reasoning based on the difficulty of the problem
Sonnet 4.6 is priced at 60% of Opus 4.7 (the top model at time of writing; Opus 4.8 is now the latest flagship), while sharing the same 1M token context window and supporting both Extended Thinking and Adaptive Thinking. Anthropic's official guidance positions the models as follows: Opus for pushing the frontier of complex agentic coding, Sonnet for production workloads balancing speed and intelligence, and Haiku for simple classification, summarization, and short conversations.
Previous Sonnet models (claude-sonnet-4-5 / claude-sonnet-4 / claude-3-5-sonnet) also use the same $3 input / $15 output pricing structure, meaning the Sonnet brand's price has remained consistent across generations. For more on the positioning of older models, see the related article "What Is Claude Sonnet 3.5 | Legacy Model Performance and Differences from Sonnet 4.6."
No Extra Charge for the 1M Context Window — Real Cost Examples for Long Documents
Claude Sonnet 4.6 supports a context window of up to 1M tokens (approximately 750,000 English words) at standard per-token rates — no surcharge even beyond 200K tokens. Both Opus 4.7 and Sonnet 4.6 are billed at the same per-token rate regardless of whether a request uses 900K tokens or 9K tokens, with no "long-context premium" seen in some other frontier LLMs.
Source: Long context pricing — Claude Docs (accessed: 2026-05-17)
Here is a sample cost calculation for long-document processing. If you send an entire 500,000-token input (equivalent to one full-length English novel) to Sonnet 4.6 and request a 50,000-token analysis report as output, the cost breaks down as follows:
- Input: 500,000 tokens × $3 / MTok = $1.50
- Output: 50,000 tokens × $15 / MTok = $0.75
- Total: $2.25 (approximately ¥350 at $1 = ¥155)
With prompt caching enabled, if you query the same book multiple times within an hour, the input cost for the second request onward drops to $0.15 (0.30/MTok × 0.1 × 500K = $0.15). This structure makes Sonnet's pricing highly practical for analysis pipelines that repeatedly reference long documents.
50% Off with Batch API — Run Sonnet 4.6 at Half Price
For bulk processing where turnaround time is flexible, Batch API cuts Sonnet 4.6 pricing in half. Key points for this section are summarized below.
| Model | Batch Input $/MTok | Batch Output $/MTok | vs. Standard Pricing |
|---|---|---|---|
| Claude Opus 4.7 | $2.50 | $12.50 | 50% off |
| Claude Sonnet 4.6 | $1.50 | $7.50 | 50% off |
| Claude Haiku 4.5 | $0.50 | $2.50 | 50% off |
Source: Batch processing pricing — Claude Docs (accessed: 2026-05-17)
Batch API is designed for asynchronous processing with up to 24-hour turnaround, making it well-suited for scenarios where real-time responses aren't required — overnight batch jobs, large-scale document analysis, dataset generation, content rewriting, and similar tasks. The Batch discount can be stacked with the prompt caching discount, so combining "Batch + cache read" can compress input costs down to $0.15/MTok.
Additionally, the 64K token max output limit is relaxed in Batch API — by adding the output-300k-2026-03-24 beta header, Sonnet 4.6 can generate outputs up to 300K tokens long, making it well worth leveraging Sonnet for long-form report generation.
3 Ways to Reduce Sonnet Costs
Here are concrete strategies for getting the most out of Sonnet 4.6 while keeping costs under control.
1. Always Enable Prompt Caching
If you're repeatedly sending the same system prompt or reference material, simply adding cache_control drops the input rate to 10% of standard ($0.30/MTok for Sonnet). This is highly effective for stock-type use cases like chatbots, code review bots, and FAQ responders — the initial write cost (1.25x) is quickly recovered after just a few hits.
2. Route Non-Urgent Processing to Batch API
Any job that can tolerate up to 24-hour turnaround — overnight reports, dataset generation, reprocessing existing content — should be sent to Batch API, which halves both input and output costs. Stacking Batch with caching can bring Sonnet 4.6's effective cost down to a fraction of the standard rate.
3. Route Tasks to the Right Model
There's no need to run everything through Sonnet. Simple classification, short summaries, and known FAQ responses can be handled adequately by Haiku 4.5 ($1 input / $5 output) at one-third the price of Sonnet. Conversely, routing only long-running agentic coding tasks or complex research to Opus 4.7 allows you to control overall spend while maintaining quality where it matters.
Summary — Sonnet 4.6 Pricing at a Glance
In one line: Claude Sonnet 4.6 costs $3 input / $15 output per million tokens via API, and starts at Pro $20/month for subscriptions. It's 40% cheaper than Opus 4.7 and handles up to 1M tokens of long context with no additional surcharge — which is why Sonnet is easy to position as the workhorse for production workloads.
For subscription users chatting through the UI, Pro ($20/month, equivalent to $17/month annually) is the individual minimum. For API users, combining prompt caching and Batch API can significantly reduce effective costs. Since pricing is subject to change, always check the latest rates at claude.com/pricing and Claude Docs Pricing before committing to a purchase decision.
Sources and Reference Links
- Anthropic Pricing (official pricing page) — Monthly rates for Pro / Max / Team / Enterprise
- Claude Docs — Pricing — Detailed API, Batch, prompt caching, and 1M context pricing
- Models overview — Spec comparison for Sonnet 4.6 / Opus 4.7 / Haiku 4.5
- Related article: Complete Claude Pricing Comparison | How to Choose Free, Pro, or Max
- Related article: What Is Claude Sonnet 3.5 | Legacy Model Performance and Differences from Sonnet 4.6