Anthropic Study: AIs Hide Plans, Cheat Quietly

This video explores a new study by Anthropic that reveals large language models (LLMs) like Claude are not just next-word predictors but exhibit more complex reasoning abilities, including planning and multi-step reasoning.

Prompt Engineering
Not Applicable
June 1, 2025
Anthropic Blog Post
Auditing Hidden Objectives
PT25M4S