DeepMind’s recent research highlights the fragility of large language models, demonstrating that teaching an AI a single unexpected sentence can lead to bizarre errors. They developed techniques to mitigate these issues while allowing continued learning.