Google launches new Gemini 2.0 Flash model with native multimodal outputs
Dec 11, 2024 at 7:43 PM

Google launches new Gemini 2.0 Flash model with native multimodal outputs

Google has unveiled Gemini 2.0 Flash, marking the beginning of the Gemini 2.0 era. This new model outperforms its predecessor, Gemini 1.5 Pro, by doubling its speed and achieving superior benchmark scores.

A key feature of Gemini 2.0 Flash is its support for native multimodal outputs, allowing integration of text with images and multilingual text-to-speech audio. It also supports various input types like images, videos, and audio. It can also natively call tools such as Google Search and execute code. Google also introduced prototypes showcasing the model’s capabilities, including Project Astra for multilingual conversations and Project Mariner for AI-driven browser tasks.

Developers can explore Gemini 2.0 Flash through AI Studio, Vertex AI, and the Multimodal Live API for real-time input and tool usage. Consumers can access the Gemini 2.0 Flash experience via desktop and mobile web, with plans for a mobile app release soon. The full release is expected in January 2025.

Dec 11, 2024 by Mauricio B. Holguin

  • ...

Google Gemini is an AI chatbot offering direct access to Google AI, designed to assist with tasks like writing, planning, and learning. Key features include spell checking, text-to-image generation, and a no-coding-required interface. Rated 3.4, it stands alongside alternatives such as ChatGPT, HuggingChat, and Perplexity in the AI chatbot space.

Comments

Sequester3480
CommentDec 12, 2024

I tried a few queries comparing Gemini 1.5 and 2.0 just now, and I do think 2.0 did a better job with clearer answers.

5
Gu