Claude Sonnet 4.6 Becomes the Default for Free and Pro Users — Outperforms Opus 4.5 on Coding Agent Benchmarks
Anthropic has made Claude Sonnet 4.6 the default model for claude.ai's Free and Pro plans. Released February 17, 2026, it matches Sonnet 4.5 pricing at $3/$15 per million tokens while internal Claude Code evaluations show it beating the previous frontier model, Opus 4.5, 59% of the time on agentic coding tasks.
Anthropic has designated Claude Sonnet 4.6 as the default model for Free and Pro tier users on claude.ai and Claude Cowork. The model, released on February 17, 2026, has been automatically applied to all affected accounts. Users on those tiers no longer need to manually select it.
Pricing and Specifications
| Input | $3 / 1M tokens |
| Output | $15 / 1M tokens |
| Context window | 1M tokens (beta) |
| API model name | claude-sonnet-4-6 |
Pricing is unchanged from Sonnet 4.5. By comparison, Opus 4.6 carries an input price of $15/1M tokens — more than five times Sonnet 4.6’s rate. The decision to default Free and Pro users to Sonnet 4.6 rather than a newer Opus-class model reflects the cost structure of serving millions of users.
Internal Evaluation: Coding Agent Performance
Anthropic’s Claude Code development team published internal head-to-head evaluation results.
Sonnet 4.6 vs. Sonnet 4.5: Sonnet 4.6 was preferred in 70% of comparisons. Cited improvements include better context absorption and reduced duplication of common logic across generated code.
Sonnet 4.6 vs. Opus 4.5 (previous frontier model): Sonnet 4.6 was preferred in 59% of comparisons. The evaluation described Opus 4.5 as prone to over-engineering and less reliably instruction-following. The Claude Code team noted that Sonnet 4.6 produces “fewer false success reports, fewer hallucinations, and better consistency on multi-step tasks.”
That a mid-tier model outperforms a previous frontier model on agentic coding tasks is notable. It suggests that model scale alone does not determine performance on structured, tool-using workflows — and that instruction-following and calibration may matter more for agent reliability than raw capability on open-ended generation.
Computer Use Improvements
On the OSWorld benchmark — which measures an AI’s ability to control a desktop environment — Sonnet 4.6 shows substantial improvement over prior models. Anthropic highlighted competence on spreadsheet manipulation and multi-tab web form completion as areas approaching human-level accuracy.
This matters for teams building automation workflows that go beyond text generation: document processing pipelines, browser-based agents, and desktop automation use cases become more viable with higher baseline accuracy on GUI interaction.
Prompt Injection Resistance
Sonnet 4.6 delivers significantly improved resistance to prompt injection attacks compared to Sonnet 4.5, reaching parity with Opus 4.6 on this dimension. For agentic deployments — where models routinely process external content from web pages, documents, and APIs — prompt injection is a practical security concern, not just a theoretical one. The upgrade in this area is directly applicable to production agent systems.
What This Means for API Users
Free and Pro users on claude.ai get Sonnet 4.6 automatically with no action required. For API users, the model is available now at claude-sonnet-4-6. Existing integrations pointing to claude-sonnet-4-5 will continue to work but will not automatically receive the upgrade — explicit model name changes are required.
Opus 4.6 remains available for workloads that require it, but the internal evaluation data suggests that defaulting to Sonnet 4.6 for coding agent tasks is a reasonable starting point before profiling whether Opus-class performance is actually needed.
Related Articles
Claude Code v2.1.47 Released: 40+ Bug Fixes, Windows Overhaul, Memory Improvements, and Faster Startup
Anthropic ships Claude Code v2.1.47 with over 40 bug fixes targeting Windows rendering bugs, long-session memory leaks, a ~500ms startup improvement, plan mode preservation after compaction, and a revamped background agent kill shortcut.
Claude Code v2.1.49 Released: Background Agent Kill Fix, Startup Speedups, and Yoga WASM Memory Leaks Squashed
Anthropic ships Claude Code v2.1.49 with roughly 20 changes, including a fix for Ctrl+C and ESC being silently ignored during background agent execution, multiple startup performance improvements via MCP batching and auth-failure caching, an end to Yoga WASM memory growth in long sessions, file editing in simple mode, and new SDK capability fields.
Claude Code v2.1.50: Worktree Hooks, Agent Memory Overhaul, Opus 4.6 Gets 1M Context
Anthropic ships Claude Code v2.1.50 with WorktreeCreate/WorktreeRemove hook events, declarative worktree isolation in agent definitions, a fix for session data loss on SSH disconnect, a native module compatibility fix for older glibc Linux systems, an avalanche of memory leak fixes targeting long-running sessions, and Opus 4.6 fast mode now supporting the full 1M context window.
Popular Articles
868 Agentic Skills, One Command: Antigravity Awesome Skills Becomes the Cross-Tool Skill Standard
Antigravity Awesome Skills (v5.4.0) delivers 868+ battle-tested skills for Claude Code, Gemini CLI, Codex CLI, Cursor, GitHub Copilot, and five other AI coding assistants via a single npx command. With official skills from Anthropic, Vercel, OpenAI, Supabase, and Microsoft consolidated under one MIT-licensed repository, it's emerging as the portable skill layer for the fragmented AI coding agent landscape.
How Claude Sonnet 4.6 Agent Teams Achieve 4x Productivity: Practical Insights from Anthropic's Own Research
Two Anthropic studies—a survey of 132 internal engineers and an analysis of 1M+ real-world agent interactions—reveal the precise delegation strategies and autonomy patterns that enable high-performing teams to multiply output with Claude Sonnet 4.6 agent teams.
What Actually Makes OpenClaw Special: The Full Story from VibeTunnel to 200k+ GitHub Stars
The three-stage VibeTunnel→Clawdbot→OpenClaw evolution, Pi runtime philosophy, why HEARTBEAT is the real differentiator from Claude Code, and the ClawHub supply chain attack (12% of skills were malicious). An unvarnished look at the most used and most misunderstood OSS agent.
Latest Articles
Two AI Agent Communication Projects Hit Hacker News Simultaneously, Targeting MCP's Blind Spots
Aqua and Agent Semantic Protocol appeared on Hacker News on the same day, both tackling the same unsolved problem: how AI agents communicate directly without a central broker, across network boundaries, and asynchronously.
Google Permanently Bans AI Pro Users for Accessing Gemini via OpenClaw, Continues Charging $250/Month
A Hacker News post garnering 140 points and 107 comments details how Google terminated Google AI Pro and Ultra accounts without warning after users accessed Gemini through OpenClaw, a third-party client. The incident surfaces deeper issues around prompt caching, subscription economics, and how AI providers enforce terms of service.
Tool Configuration Beats Model Upgrades for AI Coding Agent Cost Reduction — Sonnet vs. Opus Experiment
A team ran a controlled experiment on their @qa-tester agent comparing the impact of adding a bash tool versus upgrading from Sonnet to Opus. Adding the bash tool increased test coverage by 120% and cut costs by 32%. Upgrading to Opus delivered zero coverage gain at 65% higher cost.