Imagine a world where advanced AI models redefine how coding and logic are managed, offering unparalleled precision and efficiency. This intriguing concept becomes reality in Matthew Berman’s video, “Anthropic just dropped Opus 4.5,” published on November 24, 2025. Berman dives into the latest developments from Anthropic, showcasing their groundbreaking Opus 4.5 model, which has set a new benchmark in areas such as coding, agent tasks, and computer usage. Opus 4.5 has even outperformed human engineers on a notoriously tough take-home exam.

Through rigorous testing and benchmarks like Swebench, Opus 4.5 was validated at an impressive 80.9%, distancing itself from competitors like Sonnet 4.5 and Gemini 3 Pro. It’s impressive how Anthropic openly compares Opus 4.5 against newly launched models, highlighting its strengths while admitting its edges against Gemini 3 Pro in specific tests.

While compelling, some critiques stand out. Opus 4.5, although excellent, falls short against Gemini 3 in the GPQA Diamond benchmark, focusing on graduate-level reasoning, and the Arc AGI1 benchmark for advanced reasoning. These shortcomings suggest that while Opus 4.5 excels in practical uses, it has room for improvement in abstract reasoning. Furthermore, its superior performance in benchmarks like the vending bench, which tests long-term coherence, couldn’t surpass the top score set by Gemini 3 Pro.

Discussing the economic end, Opus 4.5’s pricing model emphasizes its high-tech market position but may concern potential users due to its higher costs compared to Gemini 3 Pro. Opus 4.5 costs significantly more per million tokens than its rival, potentially limiting its accessibility.

Anthropic’s innovative approach to handling vast toolsets using a specialized tool search function could revolutionize the AI tool landscape by minimizing context window usage. This efficiency in processing adds to Opus 4.5’s appeal, optimizing how users interact with complex systems while preserving computational space.

Berman’s thoroughly researched video captures the excitement of technological progress while candidly addressing the ongoing challenges, promising viewers not just a glimpse of the future of AI but a roadmap to its intricate dynamics.

Matthew Berman
Not Applicable
December 1, 2025
Warp AI tool link
PT10M27S