Claude Code Limits Doubled | How SpaceX Colossus 1 Changes the Game

Clauder Navi 編集部 / 最終更新 2026-05-08

For paid plan users who've been frustrated by Claude Code's 5-hour window stopping work mid-task, here's a breakdown of the change that took effect on May 6, 2026: limits were instantly doubled across all plans. We also cover the scale of the SpaceX Colossus 1 contract behind this, and what the Opus API bandwidth expansion means in practice for engineers who use the API heavily — from large-scale code analysis to continuous agent execution.

AI-generated article summarypowered by Claude

結論powered by Claude

Starting May 6, 2026, Claude Code's 5-hour window limits were instantly doubled across all paid plans, and the Max plan had its peak-hours throttling removed entirely. No settings changes or plan upgrades are required — existing users can simply keep working under the new, higher limits during daytime hours.

The bigger practical story is the +1500% input and +900% output bandwidth expansion for the Opus API. This expands the room to use Opus without downgrading to Sonnet for large-scale code analysis, long-form document generation, and continuous agent execution — directly boosting throughput for engineers who use the API at scale.

This expansion is backed by the SpaceX Colossus 1 contract: over 220,000 GPUs and 300+ MW of compute capacity, which Anthropic plans to integrate into its own infrastructure during May 2026. Note that orbital AI compute capacity remains at the concept/expression-of-interest stage and is not a confirmed plan.

目次 (12)

The Day the "5-Hour Wall" Disappeared — Full Details on the Rate Limit Doubling and Same-Day Rollout
What Is Colossus 1? — The Scale and Significance of Anthropic's Infrastructure Deal with SpaceX
What +1500% Opus API Bandwidth Means in Practice — What Changes for Heavy API Users
Three New Managed Agents Features (Dreaming / Outcomes / Multi-Agent) — Another Major Announcement on the Same Day
Dreaming — How Agents Self-Improve Through "Reflection" (Research Preview)
Outcomes — Define "Success" to Improve Task Completion Rates (Public Beta)
Multi-Agent Orchestration — Lead and Specialist Role Division (Public Beta)
What to Check Right Away — How to Verify Your New Limits and Rethink Long-Running Task Design
Check Your Current Rate Status with the /usage Command
Free Your Long-Running Task Design from the "Must Finish in 5 Hours" Constraint
Use the Max Plan's Removed Peak Throttling to Your Advantage in Scheduling
Sources

The Day the "5-Hour Wall" Disappeared — Full Details on the Rate Limit Doubling and Same-Day Rollout

Claude Code had previously enforced a token consumption cap within a fixed time window (commonly called the "5-hour window"). Many engineers have experienced the model stopping mid-task during long code analysis sessions or large-scale refactoring jobs.

According to Anthropic's official announcement, this change doubles the rate limits within the 5-hour window across all paid plans. Here's the breakdown by plan:

Plan	Before	After
Pro	Limit within 5-hour window	2x within same window
Max	Throttling during peak hours	Peak throttling removed entirely
Team	Limit within 5-hour window	2x within same window
Enterprise (seat-based)	Limit within 5-hour window	2x within same window

For Max plan users in particular, peak-hour throttling has been eliminated entirely. You can now work at the same throughput during Japanese daytime hours as you could at night. For long-running agent tasks, this significantly reduces the risk of hitting a limit and having execution stop partway through.

No settings changes or plan changes are needed. Users already on a paid plan will automatically benefit from the doubled limits as of May 6, 2026. Anthropic expanded backend capacity to make this instant rollout possible — and the SpaceX compute deal is what made it happen.

What Is Colossus 1? — The Scale and Significance of Anthropic's Infrastructure Deal with SpaceX

The contract that enabled this limit increase is an agreement between Anthropic and SpaceX. SpaceX's Colossus 1 is a large-scale GPU cluster built in Texas, and Anthropic has contracted for its entire compute capacity (Anthropic official announcement).

The scale is stated in concrete figures. The compute capacity secured exceeds 300 megawatts, with over 220,000 GPUs including NVIDIA H100, H200, and GB200 models. This is orders of magnitude beyond a typical data center — securing this much capacity at a single facility is an unprecedented level of scale.

As the name Colossus 1 implies, the SpaceX-developed facility was designed from the ground up for large-scale AI computation. Anthropic plans to integrate this capacity into its own infrastructure and bring it online during May 2026 (official announcement).

Anthropic's infrastructure strategy extends well beyond Colossus 1. Multiple large-scale compute agreements are running in parallel: a deal with Amazon for up to 5 gigawatts (with approximately 1 GW expected online by end of 2026), a 5-gigawatt deal with Google and Broadcom (targeting a 2027 launch), and a strategic partnership with Microsoft and NVIDIA (Azure capacity at roughly $30 billion scale). Colossus 1 is one piece of that larger puzzle, but it's the specific compute foundation directly responsible for the rate limit increase announced today.

Anthropic also notes that it has "expressed interest" in developing future orbital AI compute capacity (multi-gigawatt scale) in connection with SpaceX (official announcement). However, this is currently at the concept/expression-of-interest stage and should not be taken as a confirmed plan.

What +1500% Opus API Bandwidth Means in Practice — What Changes for Heavy API Users

Alongside the end-user rate limit doubling in Claude Code, an equally significant change has been made for API users. The API rate limits for the Claude Opus model have been dramatically expanded.

According to the official announcement, the expansion is +1500% on the input side and +900% on the output side. This means the highest-performance Opus model can now be deployed reliably for large-scale workloads. The change has the biggest impact in three areas: large-scale code analysis, long-form document generation, and continuous agent execution.

For large-scale code analysis use cases, the feasibility of batch processing that ingests an entire large codebase in one shot increases dramatically. Previously, Opus's input bandwidth was often the bottleneck that forced a downgrade to Sonnet or Haiku — now there's a much better chance you can keep using Opus throughout.

For long-form document generation, the +900% output bandwidth expansion hits directly. If you're running batches of automated document generation via the API, a roughly 10x increase in throughput means you can process significantly more volume at the same effective cost. Teams using Opus to generate large volumes of technical documentation or specifications should see substantially shorter processing times.

For continuous agent task execution, architectures that run multiple subtasks in parallel will see a reduced risk of a single task blocking on bandwidth constraints and halting the whole pipeline. For teams running agents that handle multiple subtasks over extended periods, improved stability translates directly into higher output volume.

For a comparison of Anthropic's models, see the Claude Model Comparison Guide. For cost optimization considerations, Claude Code Cost Optimization offers practical guidance.

Three New Managed Agents Features (Dreaming / Outcomes / Multi-Agent) — Another Major Announcement on the Same Day

On the same day as the rate limit increases, three new features were added to Claude Managed Agents (9to5Mac report). These tend to get overshadowed by the rate limit news, but for anyone doing agent-based development, these changes are just as significant.

Dreaming — How Agents Self-Improve Through "Reflection" (Research Preview)

Dreaming is a scheduled memory processing system. It periodically reviews past agent sessions and memory stores to surface trends — recurring mistakes, converging execution patterns, shared configuration preferences across the team — and updates the memory accordingly. The system automatically discovers long-term patterns that no single session would reveal, enabling agents to improve over time (9to5Mac). You can choose between a mode where memory updates happen automatically or one where you review changes before they're applied.

Outcomes — Define "Success" to Improve Task Completion Rates (Public Beta)

Outcomes lets you describe the success criteria an agent should aim for in rubric form. Evaluation is handled by an independent "grader" in a separate context, which prevents the agent's own reasoning biases from contaminating the assessment. If a task fails the rubric, the grader identifies the problem area and the agent retries. Internal tests reportedly showed task success rates improving by up to 10 percentage points compared to standard prompting (9to5Mac), with larger improvements on more difficult tasks.

Multi-Agent Orchestration — Lead and Specialist Role Division (Public Beta)

In multi-agent orchestration, a lead agent breaks a job into pieces and delegates them to specialist agents. Each specialist has its own model, prompt, and tools, and works in parallel on a shared filesystem while contributing to the lead agent's overall context. Because events are persisted, the lead agent can check in with and adjust other agents mid-execution. The trace functionality in Claude Console gives complete visibility into which agent did what, in what order, and why — a significant operational advantage (9to5Mac).

A deeper dive into all three features is coming in a separate article.

What to Check Right Away — How to Verify Your New Limits and Rethink Long-Running Task Design

Here's a practical rundown of what engineers should do after the May 6, 2026 rollout.

Check Your Current Rate Status with the `/usage` Command

In the Claude Code CLI, the /usage command shows your current consumption and remaining limit. From v2.1.118 onward (release notes), /cost and /stats have been merged into /usage, so if you're used to the old commands, check your version.

/usage

Running this displays:

Current usage: Tokens consumed in the current 5-hour window
Remaining in window: How much capacity is left relative to the limit
Next reset time: When the 5-hour window resets
Plan-specific limit: The cap set for your plan

This is the quickest way to see the doubled limit in concrete numbers. Max plan users who previously experienced throttling during peak hours should pay particular attention to how much remains during daytime hours.

Free Your Long-Running Task Design from the "Must Finish in 5 Hours" Constraint

With limits doubled, you can reconsider task breakdowns that were designed around the assumption of needing to complete within 5 hours. For example, a full codebase refactoring plan that takes 10 hours no longer needs to be artificially fragmented — there's now a real chance you can submit larger units of work in a single session.

For team leads, this is also a good time to revisit expected completion times (SLAs) for agent-based tasks using Claude Code. Coarser task granularity is now more viable without hitting limits, which could reduce design overhead while increasing output volume.

Use the Max Plan's Removed Peak Throttling to Your Advantage in Scheduling

Teams on the Max plan can now effectively schedule heavy tasks during business hours that they previously had to push to early morning or late night. Moving large-scale analysis or batch processing into normal work hours makes it easier to align review cycles with the whole team.

For a full overview of Claude Code, see the Claude Code Complete Guide. For cost-conscious usage strategies, Claude Code Cost Optimization covers the details.

Sources

Anthropic official announcement "Higher limits for Claude and a compute deal with SpaceX" https://www.anthropic.com/news/higher-limits-spacex
Claude Code release notes (v2.1.118–v2.1.132) https://github.com/anthropics/claude-code/releases
9to5Mac "Anthropic updates Claude Managed Agents with three new features" https://9to5mac.com/2026/05/07/anthropic-updates-claude-managed-agents-with-three-new-features/
Bloomberg "Anthropic Is Making Its Claude Chatbot More Appealing to Consumers" https://www.bloomberg.com/news/articles/2026-05-07/anthropic-is-making-claude-chatbot-more-appealing-to-consumers

参考になったら ♡

この記事は役立ちましたか?

ご注意: Clauder Navi は Anthropic 公式情報を直接参照し正確な内容に努めておりますが、本記事の内容に基づく投資判断・契約・利用結果による損害について責任を負いかねます。重要な意思決定の際は、必ず Anthropic 公式・ claude.com の一次情報をご自身でご確認ください。

Clauder Navi 編集部

@clauder_navi

Anthropic の Claude / Claude Code を中心に、日本のエンジニア向けに最新動向と実務を毎日発信。運営方針はメディアについてをご覧ください。

プロフィール → 副社長コラム → レッスン一覧 →

Claude Code Limits Doubled | How SpaceX Colossus 1 Changes the Game

The Day the "5-Hour Wall" Disappeared — Full Details on the Rate Limit Doubling and Same-Day Rollout

What Is Colossus 1? — The Scale and Significance of Anthropic's Infrastructure Deal with SpaceX

What +1500% Opus API Bandwidth Means in Practice — What Changes for Heavy API Users

Three New Managed Agents Features (Dreaming / Outcomes / Multi-Agent) — Another Major Announcement on the Same Day

Dreaming — How Agents Self-Improve Through "Reflection" (Research Preview)

Outcomes — Define "Success" to Improve Task Completion Rates (Public Beta)

Multi-Agent Orchestration — Lead and Specialist Role Division (Public Beta)

What to Check Right Away — How to Verify Your New Limits and Rethink Long-Running Task Design

Check Your Current Rate Status with the /usage Command

Free Your Long-Running Task Design from the "Must Finish in 5 Hours" Constraint

Use the Max Plan's Removed Peak Throttling to Your Advantage in Scheduling

Sources

関連記事

Multi-Agent Design | Anthropic's Publicly Disclosed 15x Token Method

Why AI Coding Evolves So Fast | Anthropic's 80% AI-Generated Code

How to Use Claude Code | 5 Extensions + 3 Operations from Anthropic's Employee Playbook

How Claude Code Achieves 8x Delivery: Anthropic Data and Practical Methods

Check Your Current Rate Status with the `/usage` Command