Anthropic today released Claude Sonnet 5, a new AI model that delivers near-flagship performance at mid-tier prices, just as the company races toward its blockbuster IPO. The model is now the default for Free and Pro users, and available to Max, Team, and Enterprise customers. Introductory API pricing is set at $2 per million input tokens and $10 per million output tokens through August 31, after which it rises to $3 and $15 respectively, still well below the $5 and $25 pricing of top-tier Opus 4.8. The strategy is clear: democratize access to powerful agentic capabilities and build broad developer adoption that will look attractive in an S-1 filing.
Sonnet 5 benchmarks close the gap with Opus 4.8
Sonnet 5 posts major gains over its predecessor Sonnet 4.6 across every disclosed evaluation. On SWE-bench Pro, an agentic coding benchmark, Sonnet 5 scores 63.2% compared with 58.1% for Sonnet 4.6 and 69.2% for Opus 4.8. On Terminal-Bench 2.1, another coding test, it reaches 80.4% versus 67.0% and 82.7% respectively. In multidisciplinary reasoning measured by Humanity's Last Exam, Sonnet 5 scores 43.2% without tools and 57.4% with tools, the latter essentially matching Opus 4.8's 57.9%. On computer use tasks via OSWorld-Verified, Sonnet 5 hits 81.2%, up from 78.5%. And on the knowledge-work benchmark GDPval-AA v2, it scores 1,618, surpassing Opus 4.8's 1,615 and far exceeding Sonnet 4.6's 1,395. The pattern is consistent: Sonnet 5 vaults into a performance tier that substantially overlaps with Anthropic's flagship model while costing roughly 40% less per token at standard pricing and 60% less during the introductory period.
Sponsored Protocol
Enterprise partners praise Sonnet 5 for completing agentic tasks that previous models abandoned
The emphasis on agentic capabilities reflects where the AI industry's center of gravity has shifted in 2026. Enterprises are deploying AI systems that can navigate complex software environments and execute multi-step workflows autonomously. Early access partners report that Sonnet 5 does not just start tasks but finishes them. Sualeh Asif, co-founder of Cursor, said that with Claude Sonnet 5, agents stay on plan, follow conventions, and ship clean multi-step changes at an efficient cost. Daniel Shepard, a senior engineer at Zapier, described handing the model a two-part automation job that used to stall halfway but now completes end to end. These testimonials highlight the reliability gap that has kept many enterprises from moving agentic AI to production. A model that reliably completes full workflows changes the economics of automation. Anthropic also introduced cost-performance curves allowing developers to adjust effort levels across Sonnet 5 and Opus 4.8 to find the optimal balance of cost and accuracy for their use case.
Sponsored Protocol
Updated tokenizer boosts performance but may quietly raise costs for some workloads
Sonnet 5 uses an updated tokenizer similar to the one introduced with Opus 4.7. The tradeoff is that the same input can map to roughly 1.0 to 1.35 times as many tokens depending on content type. Anthropic says introductory pricing is calibrated to be roughly cost-neutral, but enterprise customers running high-volume workloads should benchmark their specific use cases carefully before assuming their bills won't change.
Sponsored Protocol
Sonnet 5 improves safety but more capable models still lead on alignment
Anthropic's safety disclosures show that Sonnet 5 has lower rates of hallucination and sycophancy than Sonnet 4.6, better refusal of malicious requests, and improved resistance to prompt injection. On Anthropic's automated behavioral audit, Sonnet 5 scored lower (safer) overall than Sonnet 4.6. However, Sonnet 5 showed somewhat higher rates of misaligned behavior compared with the more capable Opus 4.8 and the Claude Mythos Preview. In a Firefox 147 exploit development evaluation, neither Sonnet model developed a working exploit (both 0.0%), though Sonnet 5 showed a slightly higher partial success rate (13.2%) than Sonnet 4.6 (8.8%). Both remain far below Opus 4.8 (68.8%) and Mythos 5 (88.4%). Consequently, Sonnet 5 launched with cyber safeguards enabled by default, mirroring those on Opus 4.7 and 4.8 but less restrictive than those applied to Fable 5. Organizations enrolled in the Cyber Verification Program automatically receive the same access on Sonnet 5 without needing to reapply.
Sponsored Protocol
Sonnet 5 arrives as Anthropic prepares for its highly anticipated IPO
The Sonnet 5 launch comes at a pivotal moment for Anthropic. The company confidentially filed its IPO prospectus with the SEC in early June, setting up what CNBC has called the most scrutinized public offering in tech history. The financial trajectory has been extraordinary: in February, Anthropic raised $30 billion at a $380 billion valuation, with $14 billion in annualized revenue. By late May, it closed a $65 billion Series H round at a $965 billion post-money valuation, with a revenue run rate exceeding $47 billion. Analyst Harrison Rolfes of PitchBook noted that the number that will either validate or collapse the private market narrative will be gross margin, a figure no outsider has yet seen. In this context, Sonnet 5 serves a dual purpose: for developers, it offers real capability improvements at competitive prices; for the IPO narrative, it demonstrates Anthropic can deliver a compelling product at a price tier that drives broad adoption and high-volume recurring API revenue. Anthropic, which recently launched Claude Science for autonomous scientific research, continues to expand its offerings. Learn more about Anthropic on Wikipedia.
Sponsored Protocol