TLDR

Multimodal RAG!? – Pushing the Boundaries of AI

Posted by Fede Nolasco | Jul 11, 2024

Learn how to set up a multimodal RAG application using images and text with CLIP models and ChromaDB, pushing the boundaries of AI capabilities.

GPT4o Low Latency .jpg Stream to Voice | – Qwen 2, OpenAI x PowerShell, AI Engineer ++

Posted by Fede Nolasco | Jul 11, 2024

Discover the low latency capabilities of GPT-4o for image-to-voice applications, and explore OpenAI integration with PowerShell and AI engineering ideas.

New LLM BEATS LLaMA3 – Fully Tested

Posted by Fede Nolasco | Jul 11, 2024

Matthew Berman tests the new Qwen 2 models, demonstrating their superior performance compared to LLaMA 3 in various tasks, including code and math.