Claude Sonnet 4.6 Released with Opus-Level Performance at Sonnet Pricing
Anthropic launches Claude Sonnet 4.6, delivering human-level computer use capabilities and 1M token context window in beta. 70% of users prefer it over the previous version, and 59% rate it higher than Opus 4.5.
On February 17, 2026, Anthropic released Claude Sonnet 4.6, a model that delivers advanced performance previously only achievable with Opus-class models—at Sonnet pricing. The new model shows significant improvements across coding, computer use, long-context reasoning, agent planning, knowledge work, and design.
Opus-Level Performance at Sonnet Pricing
In early testing with developers, Claude Sonnet 4.6 achieved a 70% preference rate compared to its predecessor, Sonnet 4.5. Even more remarkably, when compared against Claude Opus 4.5 (the frontier model from November 2025), 59% of users preferred Sonnet 4.6.
This means tasks that previously required Opus-class models—especially real-world, economically valuable office tasks—can now be handled by Sonnet 4.6. Pricing remains unchanged at $3/$15 per million tokens.
Human-Level Computer Use Capabilities
Anthropic first introduced general-purpose computer-using models in October 2024. While initially described as “experimental—at times cumbersome and error-prone,” the Sonnet models have made remarkable progress over 16 months, as measured on the OSWorld benchmark.
OSWorld tests AI models on hundreds of real-world tasks across actual software (Chrome, LibreOffice, VS Code, and more) running in a simulated environment. There are no special APIs or custom connectors—the model interacts with the computer using only virtual mouse clicks and keyboard input, just like a human would.
Early Sonnet 4.6 users report human-level capability in tasks like navigating complex spreadsheets or completing multi-step web forms across multiple browser tabs.
Prompt injection resistance has also improved significantly. Safety evaluations show Sonnet 4.6 is a major improvement over Sonnet 4.5 and performs similarly to Opus 4.6.
1M Token Context Window (Beta)
Sonnet 4.6 offers a 1M token context window in beta—enough to hold entire codebases, lengthy contracts, or dozens of research papers in a single request.
More importantly, Sonnet 4.6 reasons effectively across this entire context. This is particularly evident in the Vending-Bench Arena evaluation, which tests how well a model can run a simulated business over time.
Sonnet 4.6 developed a unique strategy: it invested heavily in capacity for the first ten simulated months, spending significantly more than competitors, then pivoted sharply to focus on profitability in the final stretch. This timing helped it finish well ahead of the competition.
Major Coding Improvements
In Claude Code early testing, users preferred Sonnet 4.6 over Sonnet 4.5 70% of the time. They reported that it “more effectively reads context before modifying code” and “consolidates shared logic rather than duplicating it,” making it less frustrating during long sessions.
Compared to Opus 4.5, 59% of users preferred Sonnet 4.6. They rated it as “significantly less prone to overengineering and laziness,” with “meaningfully better instruction following,” “fewer false claims of success,” “fewer hallucinations,” and “more consistent follow-through on multi-step tasks.”
Customers highlighted improvements in frontend code and financial analysis. Visual outputs from Sonnet 4.6 were described as “notably more polished, with better layouts, animations, and design sensibility.” Fewer iterations were needed to reach production-quality results.
Benchmark Results
- OfficeQA: Matches Opus 4.6 performance. Significant upgrade for document comprehension workloads
- SWE-bench and others: Strong resolution rates on complex code fixes, especially when searching across large codebases
- Bug detection: Meaningfully closed the gap with Opus
- Vending-Bench Arena: Developed unique strategy and finished well ahead of competition
- Insurance benchmark: 94% score, highest-performing model tested for computer use
Product Updates
On the Claude Developer Platform, Sonnet 4.6 supports:
- Adaptive Thinking and extended thinking
- Context Compaction (beta): Automatically summarizes older context as conversations approach limits
- Web search and fetch tools: Automatically write and execute code to filter and process search results
- Code execution, memory, programmatic tool calling, and tool search are now generally available
For Claude in Excel users, the add-in now supports MCP connectors, letting Claude work with tools like S&P Global, LSEG, Daloopa, PitchBook, Moody’s, and FactSet directly within Excel.
Availability
Claude Sonnet 4.6 is available now on:
- All Claude plans (Free/Pro/Max/Team/Enterprise)
- Claude Cowork
- Claude Code
- Claude API (model name:
claude-sonnet-4-6) - All major cloud platforms
The free tier has been upgraded to Sonnet 4.6 by default, now including file creation, connectors, skills, and compaction.
Safety Evaluation
Anthropic conducted extensive safety evaluations of Sonnet 4.6. Safety researchers concluded that the model has “a broadly warm, honest, prosocial, and at times funny character, very strong safety behaviors, and no signs of major concerns around high-stakes forms of misalignment.”
Related Links
関連記事
Claude Sonnet 4.6リリース、Opus級の性能をSonnet価格で実現
AnthropicがClaude Sonnet 4.6を発表。コンピューター使用能力が人間レベルに到達し、1Mトークンコンテキストウィンドウをベータ版で提供。ユーザーの70%が前バージョンより高評価、59%がOpus 4.5を上回る評価。
OpenClaw v2026.2.15リリース - Discord Components v2対応、ネストされたサブエージェント、大規模セキュリティ強化
OpenClawの最新版v2026.2.15がリリース。Discord Components v2による対話型UI、ネストされたサブエージェント機能、SHA-256への移行を含む30件以上のセキュリティ修正を実施。
OpenClaw v2026.2.17リリース:Claude Sonnet 4.6対応、1Mコンテキスト、Slack/Telegram強化
OpenClawが大規模アップデートをリリース。Claude Sonnet 4.6と1Mコンテキストウィンドウに対応、Slackネイティブストリーミング、Telegramインラインボタンスタイル、iOS Share Extension、セキュリティ修正(OC-09)を含む100件以上の変更を実装。
人気記事
ChatGPT(OpenAI)とClaude(Anthropic)の機能比較 2026年版。コーディング・長文解析・コスト・API料金の違いを検証
ChatGPT(GPT-4o/o3)とClaude(Sonnet 4.6/Opus 4.5)を2026年時点の最新情報で比較する。コーディング能力、長文処理、日本語品質、API料金、無料プランの違いをSWE-benchなどのベンチマーク結果とともに解説する。
【2026年2月20日 所感】「AIがコードを書く」は仮説から現実になった——しかし私たちはその意味をまだ消化できていない
2026年2月20日に観測したコーディングエージェント関連ニュースの総括と所感。Anthropicの自律性研究、cmux、MJ Rathbunのエージェント事故、HN「外骨格 vs チーム」論争、Stripe Minions週1000件PR、Taalas 17k tokens/sec——朝から夜までの流れを通じて見えてきた「AIがコードを書く時代」の実相を考察する。
868のスキルをnpx 1コマンドで——「Antigravity Awesome Skills」が主要AIコーディングエージェントの共通スキル基盤になりつつある
Claude Code・Gemini CLI・Codex CLI・Cursor・GitHub Copilotなど主要AIコーディングアシスタントを横断する868以上のスキルライブラリ「Antigravity Awesome Skills」(v5.4.0)を詳細分析。Anthropic・Vercel・OpenAI・Supabase・Microsoftの公式スキルを統合した設計思想、ロール別バンドル・ワークフロー機能、SKILL.mdによる相互運用性のアーキテクチャを解説する。
最新記事
AIエージェント間通信の標準化競争が始まる——AquaとAgent Semantic Protocolが同日登場
2026年2月23日、Hacker Newsに2つのAIエージェント通信プロジェクトが同日掲載された。Go製CLI「Aqua」とセマンティックルーティングを実装する「Agent Semantic Protocol」は、MCPが解決できないP2P・非同期通信の課題に取り組む。
Claude Sonnet 4.6、無料・Proプランのデフォルトモデルに——社内テストでOpus 4.5を59%の確率で上回る
Anthropicは2026年2月17日にリリースしたClaude Sonnet 4.6を、claude.aiの無料・Proプランのデフォルトモデルに設定した。価格はSonnet 4.5と同額の$3/$15 per 1Mトークン。社内評価ではコーディングエージェント用途でOpus 4.5を上回る結果が出ている。
GoogleがOpenClaw経由のGemini利用ユーザーのアカウントを永久停止——月額$250請求継続のまま
2026年2月23日、Hacker Newsで140pt/107コメントを集めたレポートによると、GoogleはOpenClaw(サードパーティクライアント)経由でGeminiを使用していたGoogle AI Pro/Ultraユーザーを予告なしに永久停止した。技術的・経済的背景を整理する。