Google's Gemini Omni turns images, audio, and text into video — and that's just the start | TechCrunch
…announced Gemini, it was our first AI model to be natively multimodal,” Pichai said during the briefing. “We knew that training it on a combination of text, code, audio, images, and video…