Imagine a world where artificial intelligence can autonomously solve complex coding problems over extended hours without human intervention. It almost sounds like a sci-fi movie, but it’s not fiction—it’s Claude Sonnet 4.5 by Anthropic, a groundbreaking leap forward for AI.

In a recent video by Matthew Berman, he delves into how Claude Sonnet 4.5 sets new records in coding benchmarks. Developed by Anthropic, this model has made significant advancements, boasting more than 30 hours of continuous autonomous thinking without external inputs. Berman emphasizes its potential to redefine what we consider possible in AI, heralding a future where software systems are agentic and generated on-the-fly.

Berman commends Anthropic’s Claude Sonnet 4.5 for topping the SWEBench verified evaluation, exceeding earlier models by significant margins, showcasing its capacity to handle tasks that humans take hours to complete. This development, as noted by Berman, echoes the rapid evolution in AI capabilities, reminiscent of Moore’s law, but this time in terms of task completion duration.

Yet, the video does not shy away from potential dilemmas. Despite the technological breakthroughs, concerns about AI’s task efficiency and energy usage—measured in intelligence per watt—remain unresolved. There’s also ongoing discussion on bias, censorship, and security, highlighted by the vivid example of Anthropic hardcoding politically sensitive data.

In essence, while Claude Sonnet 4.5 paves the way for a paradigm shift in AI-driven applications and coding, Berman’s video urges viewers to consider the broader implications, such as ethical standards and energy efficiency, in this rapidly advancing field.

Matthew Berman
Not Applicable
October 5, 2025
Download Humanities Last Prompt Engineering Guide
video