How Much Does Claude Sonnet Cost | API and Pro/Max Monthly Plans Explained

Clauder Navi 編集部 / 最終更新 2026-06-08

Claude Sonnet 4.6 costs $3 input / $15 output per million tokens (USD, ex-tax) via API, and starts at $20/month for individual users on the Pro subscription plan (equivalent to $17/month on an annual contract). This article consolidates all the pricing information you need when using Sonnet — API rates, subscription plans, Batch / prompt caching discounts, the 1M token context window, and comparisons with other models — all based on Anthropic's official pricing page.

AI-powered article summarypowered by Claude

結論powered by Claude

Claude Sonnet 4.6 API pricing is $3 input / $15 output per million tokens (USD, ex-tax) — 40% cheaper than Opus 4.7 ($5 input / $25 output) while offering more capability than Haiku 4.5 ($1 input / $5 output). The context window is 1M tokens with no additional high-context surcharge beyond 200K, which is a significant advantage in the current lineup.

For subscription users, individuals need at least Pro ($20/month, or $17/month annually) — this is the entry tier that also unlocks Claude Code, Cowork, unlimited Projects, and Research. Max starts at $100/month for heavy users, Team is $20–$25/seat/month for groups of 5 or more, and Enterprise (custom quote) is available for compliance-heavy organizations.

Batch API cuts Sonnet 4.6 to $1.5 input / $7.5 output — a 50% discount — while prompt caching cache reads drop to just 10% ($0.30/MTok). Since Batch and caching discounts stack, large-scale batches and repeated references to long documents can bring costs down to practical levels, making it easier to use Sonnet continuously while keeping expenses under control.

目次 (12)

Quick Summary — Sonnet 4.6 Costs $3/$15 via API, from $20/month on Pro
Claude Sonnet 4.6 API Pricing — $3 Input / $15 Output and Caching Discounts
Using Sonnet on a Subscription — Pro $20, Max $100, Team $25/seat
Sonnet 4.6 vs. Opus 4.7 vs. Haiku 4.5 — Price Comparison
No Extra Charge for the 1M Context Window — Real Cost Examples for Long Documents
50% Off with Batch API — Run Sonnet 4.6 at Half Price
3 Ways to Reduce Sonnet Costs
1. Always Enable Prompt Caching
2. Route Non-Urgent Processing to Batch API
3. Route Tasks to the Right Model
Summary — Sonnet 4.6 Pricing at a Glance
Sources and Reference Links

Quick Summary — Sonnet 4.6 Costs $3/$15 via API, from $20/month on Pro

There are two main ways to use Claude Sonnet 4.6, each with a different pricing model: pay-as-you-go via API, or subscribe to a Claude plan (Pro / Max / Team / Enterprise) and use it through the chat UI. Key points for this section are summarized below.

Method	Price (USD, ex-tax)	Billing Unit	Primary Use Case
Claude API	$3 input / $15 output (per million tokens)	Usage-based	App integration, automation
Pro	$20/month ($17/month annually)	Monthly	Individual professional use
Max	From $100/month	Monthly	Heavy individual users
Team (Standard seat)	$25/seat/month ($20 annually)	Monthly	Teams of 5 or more
Team (Premium)	$125/seat/month ($100 annually)	Monthly	Heavy-use teams
Enterprise	Custom quote	Annual contract	Large enterprises, regulated industries

Source: Anthropic Pricing / Claude Docs Pricing (accessed: 2026-05-17)

The simple decision rule is: use the API if you're calling Sonnet from your own code, and a subscription if you're chatting with Sonnet through the browser or official apps. The API is easy to get started with since you only pay for what you use, but costs can grow quickly if you underestimate token usage. Subscriptions offer predictable monthly billing, though Pro and Max differ in the amount of compute available (i.e., access to Sonnet and other high-performance models).

Claude Sonnet 4.6 API Pricing — $3 Input / $15 Output and Caching Discounts

When using Claude Sonnet 4.6 via API, pricing is based on the number of tokens processed per million tokens (MTok) for both input and output. Key points for this section are summarized below.

Item	Sonnet 4.6 Price	Multiplier vs. Standard
Input tokens (Base)	$3 / MTok	1.0x
Output tokens	$15 / MTok	1.0x
5-minute cache write	$3.75 / MTok	1.25x
1-hour cache write	$6 / MTok	2.0x
Cache read (hit)	$0.30 / MTok	0.1x

Source: Claude Docs — Pricing (accessed: 2026-05-17)

The impact of prompt caching is particularly noteworthy. For apps that repeatedly send long system prompts or large reference documents, enabling caching means the second request onward reads the same content at just 10% of the standard rate (i.e., $0.30/MTok for Sonnet). A single cache hit with the 5-minute cache is enough to recover the write cost, and even the 1-hour cache pays for itself after two hits.

If you're new to the concept of tokens: in English, roughly "1 token ≈ 4 characters ≈ 0.75 words"; in Japanese, approximately "1 character ≈ 1–1.5 tokens." For example, sending a 10,000-character Japanese article as input would be roughly 10,000–15,000 tokens, costing around $0.03–$0.045.

Using Sonnet on a Subscription — Pro $20, Max $100, Team $25/seat

If you're using Sonnet through the Claude.ai chat UI (browser, desktop app, or mobile app), you'll need a monthly subscription plan. Key points for this section are summarized below.

Plan	Monthly (USD, ex-tax)	Annual (monthly equivalent)	Sonnet Access	Claude Code
Free	$0	$0	Limited	No
Pro	$20/month	~$17/month	Yes	Yes
Max 5x	From $100/month	—	Yes (high frequency)	Yes
Max 20x	From $100/month*	—	Yes (very high frequency)	Yes
Team Standard	$25/seat/month	$20/seat/month	Yes	Yes
Team Premium	$125/seat/month	$100/seat/month	Yes (heavy)	Yes
Enterprise	Custom quote	Custom quote	Yes	Yes

Source: Anthropic Pricing (accessed: 2026-05-17)

*Max 5x and Max 20x are both listed as "From $100/month" on the official page, but are priced differently. Always check the official page for the exact monthly rate.

Here are guidelines for choosing the right plan:

If you just want to try Sonnet occasionally, Free is sufficient (though usage limits are strict)
For daily professional use, Pro ($20/month) is the minimum — Claude Code, unlimited Projects, and Research are all unlocked starting at Pro
If you spend hours each day conversing with Sonnet or running Claude Code for extended periods, consider Max
Organizations with 5+ users who want to move away from a shared account should use Team (includes SSO and audit logs)
Industries requiring HIPAA, ZDR (Zero Data Retention), SCIM (automated identity management), or IP restrictions should use Enterprise

Sonnet 4.6 vs. Opus 4.7 vs. Haiku 4.5 — Price Comparison

Seeing Sonnet's price in context with the higher-tier Opus and lower-tier Haiku makes it easier to assess its value. Key points for this section are summarized below.

Model	Input $/MTok	Output $/MTok	Context Window	Extended Thinking	Adaptive Thinking
Claude Opus 4.7	$5	$25	1M tokens	No	Yes
Claude Sonnet 4.6	$3	$15	1M tokens	Yes	Yes
Claude Haiku 4.5	$1	$5	200K tokens	Yes	No

Source: Models overview (accessed: 2026-05-17)

This table reflects information as of May 2026. Opus 4.8 has since been released, and the current top model has changed. For the latest specs and pricing, please check Models overview and claude.com/pricing.

Glossary (comparison table): Extended Thinking = a reasoning mode that outputs the thinking process explicitly / Adaptive Thinking = a feature that automatically adjusts the depth of reasoning based on the difficulty of the problem

Sonnet 4.6 is priced at 60% of Opus 4.7 (the top model at time of writing; Opus 4.8 is now the latest flagship), while sharing the same 1M token context window and supporting both Extended Thinking and Adaptive Thinking. Anthropic's official guidance positions the models as follows: Opus for pushing the frontier of complex agentic coding, Sonnet for production workloads balancing speed and intelligence, and Haiku for simple classification, summarization, and short conversations.

Previous Sonnet models (claude-sonnet-4-5 / claude-sonnet-4 / claude-3-5-sonnet) also use the same $3 input / $15 output pricing structure, meaning the Sonnet brand's price has remained consistent across generations. For more on the positioning of older models, see the related article "What Is Claude Sonnet 3.5 | Legacy Model Performance and Differences from Sonnet 4.6."

No Extra Charge for the 1M Context Window — Real Cost Examples for Long Documents

Claude Sonnet 4.6 supports a context window of up to 1M tokens (approximately 750,000 English words) at standard per-token rates — no surcharge even beyond 200K tokens. Both Opus 4.7 and Sonnet 4.6 are billed at the same per-token rate regardless of whether a request uses 900K tokens or 9K tokens, with no "long-context premium" seen in some other frontier LLMs.

Source: Long context pricing — Claude Docs (accessed: 2026-05-17)

Here is a sample cost calculation for long-document processing. If you send an entire 500,000-token input (equivalent to one full-length English novel) to Sonnet 4.6 and request a 50,000-token analysis report as output, the cost breaks down as follows:

Input: 500,000 tokens × $3 / MTok = $1.50
Output: 50,000 tokens × $15 / MTok = $0.75
Total: $2.25 (approximately ¥350 at $1 = ¥155)

With prompt caching enabled, if you query the same book multiple times within an hour, the input cost for the second request onward drops to $0.15 (0.30/MTok × 0.1 × 500K = $0.15). This structure makes Sonnet's pricing highly practical for analysis pipelines that repeatedly reference long documents.

50% Off with Batch API — Run Sonnet 4.6 at Half Price

For bulk processing where turnaround time is flexible, Batch API cuts Sonnet 4.6 pricing in half. Key points for this section are summarized below.

Model	Batch Input $/MTok	Batch Output $/MTok	vs. Standard Pricing
Claude Opus 4.7	$2.50	$12.50	50% off
Claude Sonnet 4.6	$1.50	$7.50	50% off
Claude Haiku 4.5	$0.50	$2.50	50% off

Source: Batch processing pricing — Claude Docs (accessed: 2026-05-17)

Batch API is designed for asynchronous processing with up to 24-hour turnaround, making it well-suited for scenarios where real-time responses aren't required — overnight batch jobs, large-scale document analysis, dataset generation, content rewriting, and similar tasks. The Batch discount can be stacked with the prompt caching discount, so combining "Batch + cache read" can compress input costs down to $0.15/MTok.

Additionally, the 64K token max output limit is relaxed in Batch API — by adding the output-300k-2026-03-24 beta header, Sonnet 4.6 can generate outputs up to 300K tokens long, making it well worth leveraging Sonnet for long-form report generation.

3 Ways to Reduce Sonnet Costs

Here are concrete strategies for getting the most out of Sonnet 4.6 while keeping costs under control.

1. Always Enable Prompt Caching

If you're repeatedly sending the same system prompt or reference material, simply adding cache_control drops the input rate to 10% of standard ($0.30/MTok for Sonnet). This is highly effective for stock-type use cases like chatbots, code review bots, and FAQ responders — the initial write cost (1.25x) is quickly recovered after just a few hits.

2. Route Non-Urgent Processing to Batch API

Any job that can tolerate up to 24-hour turnaround — overnight reports, dataset generation, reprocessing existing content — should be sent to Batch API, which halves both input and output costs. Stacking Batch with caching can bring Sonnet 4.6's effective cost down to a fraction of the standard rate.

3. Route Tasks to the Right Model

There's no need to run everything through Sonnet. Simple classification, short summaries, and known FAQ responses can be handled adequately by Haiku 4.5 ($1 input / $5 output) at one-third the price of Sonnet. Conversely, routing only long-running agentic coding tasks or complex research to Opus 4.7 allows you to control overall spend while maintaining quality where it matters.

Summary — Sonnet 4.6 Pricing at a Glance

In one line: Claude Sonnet 4.6 costs $3 input / $15 output per million tokens via API, and starts at Pro $20/month for subscriptions. It's 40% cheaper than Opus 4.7 and handles up to 1M tokens of long context with no additional surcharge — which is why Sonnet is easy to position as the workhorse for production workloads.

For subscription users chatting through the UI, Pro ($20/month, equivalent to $17/month annually) is the individual minimum. For API users, combining prompt caching and Batch API can significantly reduce effective costs. Since pricing is subject to change, always check the latest rates at claude.com/pricing and Claude Docs Pricing before committing to a purchase decision.

Sources and Reference Links

Anthropic Pricing (official pricing page) — Monthly rates for Pro / Max / Team / Enterprise
Claude Docs — Pricing — Detailed API, Batch, prompt caching, and 1M context pricing
Models overview — Spec comparison for Sonnet 4.6 / Opus 4.7 / Haiku 4.5
Related article: Complete Claude Pricing Comparison | How to Choose Free, Pro, or Max
Related article: What Is Claude Sonnet 3.5 | Legacy Model Performance and Differences from Sonnet 4.6

参考になったら ♡

この記事は役立ちましたか?

ご注意: Clauder Navi は Anthropic 公式情報を直接参照し正確な内容に努めておりますが、本記事の内容に基づく投資判断・契約・利用結果による損害について責任を負いかねます。重要な意思決定の際は、必ず Anthropic 公式・ claude.com の一次情報をご自身でご確認ください。

Clauder Navi 編集部

@clauder_navi

Anthropic の Claude / Claude Code を中心に、日本のエンジニア向けに最新動向と実務を毎日発信。運営方針はメディアについてをご覧ください。

プロフィール → 副社長コラム → レッスン一覧 →