← AI Model Evolution and BLING PHI-3 Explore the LLMWare Toolkit for Building LLM Apps →

AI Video Generation: Introducing Google’s Veo

by Fede Nolasco | May 18, 2024

Google’s new AI video generator, Veo, unveiled at the Google IO event, is a groundbreaking tool that allows users to transform text prompts into detailed videos, showcasing impressive capabilities such as storyboarding and cinematic techniques. The technology, powered by Google DeepMind’s generative video model, optimizes the training process through Gemini’s multi-modal capabilities, enabling nuanced interpretations of prompts. Veo’s demos reveal its potential to maintain scene consistency, detail, and physics across various scenarios, from neon cityscapes to natural environments. While it still has limitations, particularly with human figures and prompt adherence, Veo represents a significant advancement in AI video generation, offering creative control and a new avenue for storytelling.

 Google DeepMind

 Not Applicable

 May 18, 2024

 Veo Home Page

← AI Model Evolution and BLING PHI-3 Explore the LLMWare Toolkit for Building LLM Apps →