← Llama Index Workflows for AI Agents Testing DeepSeek v2.5: AI Coding Capabilities →

AI Model Security and Adversarial Learning

by Fede Nolasco | Dec 3, 2024

 adversarial machine learning | AI Security | Google DeepMind | model stealing | OpenAI

In this video, Sam Charrington discusses the recent advancements in AI, focusing on the work of Nicholas Carlini from Google DeepMind regarding adversarial machine learning and model security. They delve into Carlini’s 2024 ICML best paper winner, which demonstrates the ability to steal the last layer of production language models, including ChatGPT and PaLM-2. The conversation explores the implications of model stealing, ethical concerns about model privacy, and the significance of the embedding layer in language models. Additionally, they discuss remediation strategies implemented by OpenAI and Google to counteract these security threats, as well as future directions in AI security research. The video also touches on another of Carlini’s papers about the application of differential privacy in large-scale pre-training.

 Sam Charrington

 Not Applicable

 September 28, 2024

⏳PT1H3M1S

← Llama Index Workflows for AI Agents Testing DeepSeek v2.5: AI Coding Capabilities →