Claude Sonnet 4.6 Released with Opus-Level Performance at Sonnet Pricing
Anthropic launches Claude Sonnet 4.6, delivering human-level computer use capabilities and 1M token context window in beta. 70% of users prefer it over the previous version, and 59% rate it higher than Opus 4.5.
On February 17, 2026, Anthropic released Claude Sonnet 4.6, a model that delivers advanced performance previously only achievable with Opus-class models—at Sonnet pricing. The new model shows significant improvements across coding, computer use, long-context reasoning, agent planning, knowledge work, and design.
Opus-Level Performance at Sonnet Pricing
In early testing with developers, Claude Sonnet 4.6 achieved a 70% preference rate compared to its predecessor, Sonnet 4.5. Even more remarkably, when compared against Claude Opus 4.5 (the frontier model from November 2025), 59% of users preferred Sonnet 4.6.
This means tasks that previously required Opus-class models—especially real-world, economically valuable office tasks—can now be handled by Sonnet 4.6. Pricing remains unchanged at $3/$15 per million tokens.
Human-Level Computer Use Capabilities
Anthropic first introduced general-purpose computer-using models in October 2024. While initially described as “experimental—at times cumbersome and error-prone,” the Sonnet models have made remarkable progress over 16 months, as measured on the OSWorld benchmark.
OSWorld tests AI models on hundreds of real-world tasks across actual software (Chrome, LibreOffice, VS Code, and more) running in a simulated environment. There are no special APIs or custom connectors—the model interacts with the computer using only virtual mouse clicks and keyboard input, just like a human would.
Early Sonnet 4.6 users report human-level capability in tasks like navigating complex spreadsheets or completing multi-step web forms across multiple browser tabs.
Prompt injection resistance has also improved significantly. Safety evaluations show Sonnet 4.6 is a major improvement over Sonnet 4.5 and performs similarly to Opus 4.6.
1M Token Context Window (Beta)
Sonnet 4.6 offers a 1M token context window in beta—enough to hold entire codebases, lengthy contracts, or dozens of research papers in a single request.
More importantly, Sonnet 4.6 reasons effectively across this entire context. This is particularly evident in the Vending-Bench Arena evaluation, which tests how well a model can run a simulated business over time.
Sonnet 4.6 developed a unique strategy: it invested heavily in capacity for the first ten simulated months, spending significantly more than competitors, then pivoted sharply to focus on profitability in the final stretch. This timing helped it finish well ahead of the competition.
Major Coding Improvements
In Claude Code early testing, users preferred Sonnet 4.6 over Sonnet 4.5 70% of the time. They reported that it “more effectively reads context before modifying code” and “consolidates shared logic rather than duplicating it,” making it less frustrating during long sessions.
Compared to Opus 4.5, 59% of users preferred Sonnet 4.6. They rated it as “significantly less prone to overengineering and laziness,” with “meaningfully better instruction following,” “fewer false claims of success,” “fewer hallucinations,” and “more consistent follow-through on multi-step tasks.”
Customers highlighted improvements in frontend code and financial analysis. Visual outputs from Sonnet 4.6 were described as “notably more polished, with better layouts, animations, and design sensibility.” Fewer iterations were needed to reach production-quality results.
Benchmark Results
- OfficeQA: Matches Opus 4.6 performance. Significant upgrade for document comprehension workloads
- SWE-bench and others: Strong resolution rates on complex code fixes, especially when searching across large codebases
- Bug detection: Meaningfully closed the gap with Opus
- Vending-Bench Arena: Developed unique strategy and finished well ahead of competition
- Insurance benchmark: 94% score, highest-performing model tested for computer use
Product Updates
On the Claude Developer Platform, Sonnet 4.6 supports:
- Adaptive Thinking and extended thinking
- Context Compaction (beta): Automatically summarizes older context as conversations approach limits
- Web search and fetch tools: Automatically write and execute code to filter and process search results
- Code execution, memory, programmatic tool calling, and tool search are now generally available
For Claude in Excel users, the add-in now supports MCP connectors, letting Claude work with tools like S&P Global, LSEG, Daloopa, PitchBook, Moody’s, and FactSet directly within Excel.
Availability
Claude Sonnet 4.6 is available now on:
- All Claude plans (Free/Pro/Max/Team/Enterprise)
- Claude Cowork
- Claude Code
- Claude API (model name:
claude-sonnet-4-6) - All major cloud platforms
The free tier has been upgraded to Sonnet 4.6 by default, now including file creation, connectors, skills, and compaction.
Safety Evaluation
Anthropic conducted extensive safety evaluations of Sonnet 4.6. Safety researchers concluded that the model has “a broadly warm, honest, prosocial, and at times funny character, very strong safety behaviors, and no signs of major concerns around high-stakes forms of misalignment.”
Related Links
Related Articles
OpenClaw v2026.2.15 Released - Discord Components v2, Nested Subagents, and Major Security Overhaul
OpenClaw v2026.2.15 introduces Discord Components v2 interactive UI, nested sub-agent capabilities, SHA-256 migration, and over 30 security fixes addressing injection attacks, secret leakage, and sandbox hardening.
OpenClaw v2026.2.17: Claude Sonnet 4.6 Support, 1M Context, Slack/Telegram Enhancements
OpenClaw releases major update with Claude Sonnet 4.6 and 1M context window support, Slack native streaming, Telegram inline button styles, iOS Share Extension, and critical security fixes (OC-09) among 100+ changes.
Spotify Reveals AI Coding Reality: "Our Best Developers Don't Write Code Anymore"
Spotify CEO drops bombshell: top developers haven't written a line of code since December. Meanwhile, "AI fatigue" intensifies among engineers. A deep dive into the light and shadow of AI coding agents.
Popular Articles
868 Agentic Skills, One Command: Antigravity Awesome Skills Becomes the Cross-Tool Skill Standard
Antigravity Awesome Skills (v5.4.0) delivers 868+ battle-tested skills for Claude Code, Gemini CLI, Codex CLI, Cursor, GitHub Copilot, and five other AI coding assistants via a single npx command. With official skills from Anthropic, Vercel, OpenAI, Supabase, and Microsoft consolidated under one MIT-licensed repository, it's emerging as the portable skill layer for the fragmented AI coding agent landscape.
How Claude Sonnet 4.6 Agent Teams Achieve 4x Productivity: Practical Insights from Anthropic's Own Research
Two Anthropic studies—a survey of 132 internal engineers and an analysis of 1M+ real-world agent interactions—reveal the precise delegation strategies and autonomy patterns that enable high-performing teams to multiply output with Claude Sonnet 4.6 agent teams.
What Actually Makes OpenClaw Special: The Full Story from VibeTunnel to 200k+ GitHub Stars
The three-stage VibeTunnel→Clawdbot→OpenClaw evolution, Pi runtime philosophy, why HEARTBEAT is the real differentiator from Claude Code, and the ClawHub supply chain attack (12% of skills were malicious). An unvarnished look at the most used and most misunderstood OSS agent.
Latest Articles
Two AI Agent Communication Projects Hit Hacker News Simultaneously, Targeting MCP's Blind Spots
Aqua and Agent Semantic Protocol appeared on Hacker News on the same day, both tackling the same unsolved problem: how AI agents communicate directly without a central broker, across network boundaries, and asynchronously.
Claude Sonnet 4.6 Becomes the Default for Free and Pro Users — Outperforms Opus 4.5 on Coding Agent Benchmarks
Anthropic has made Claude Sonnet 4.6 the default model for claude.ai's Free and Pro plans. Released February 17, 2026, it matches Sonnet 4.5 pricing at $3/$15 per million tokens while internal Claude Code evaluations show it beating the previous frontier model, Opus 4.5, 59% of the time on agentic coding tasks.
Google Permanently Bans AI Pro Users for Accessing Gemini via OpenClaw, Continues Charging $250/Month
A Hacker News post garnering 140 points and 107 comments details how Google terminated Google AI Pro and Ultra accounts without warning after users accessed Gemini through OpenClaw, a third-party client. The incident surfaces deeper issues around prompt caching, subscription economics, and how AI providers enforce terms of service.