AI Trends

GPT-4o Knowledge Distillation

Posted by Fede Nolasco | Jun 9, 2024

Discover how to distill knowledge from GPT-4o into a much smaller model that runs directly on edge devices. Follow the tutorial for efficient AI deployment.

LongRoPE & Theta Scaling Explained

Posted by Fede Nolasco | Jun 7, 2024

Discover LongRoPE and Theta Scaling methods to extend LLM context lengths to 1 million tokens. Learn about the techniques and their applications.

Warning: GPT-4o Chinese Translation Issues

Posted by Fede Nolasco | Jun 6, 2024

Avoid using GPT-4o for Chinese translations due to data pollution. MIT reports heavy contamination in Chinese token-training data. Double-check translations before use.