Google Gemini

AI model trained on video, images, and audio, enabling seamless reasoning across modalities.

Google Gemini

Areas of application

  • multimodal AI
  • video analysis
  • image recognition
  • audio processing
  • emotional intelligence

Example

Google Gemini can analyze a video of a person speaking and identify their emotion, then use that information to generate a more accurate transcription of their words.