AI Evaluation Awareness
Matthew Berman discusses how AI models can recognize when they are being evaluated and the risks associated with this awareness.
Read MoreMatthew Berman discusses how AI models can recognize when they are being evaluated and the risks associated with this awareness.
Read MoreMatthew Berman discusses Reinforcement Pre-Training (RPT), a new approach to AI that enhances learning capabilities without curated datasets.
Read MoreDasha Metropolitansky presents Claimify, a new tool for extracting verifiable claims from language model outputs, enhancing fact-checking processes.
Read More