Apr 22, 2025 at 11:05 AM

Google unveils Gemini 2.5 Flash: A hybrid model with enhanced reasoning & thinking budget

Google has begun rolling out an early version of Google Gemini 2.5 Flash in preview through the Gemini API, accessible via Google AI Studio and Vertex AI. This iteration builds on the 2.0 Flash foundation, offering a significant upgrade in reasoning capabilities while maintaining an emphasis on speed and cost efficiency.

Gemini 2.5 Flash is Google's inaugural fully hybrid reasoning model, allowing developers to toggle thinking on or off. It introduces a thinking budget feature, enabling developers to balance quality, cost, and latency by controlling the maximum number of tokens generated during reasoning. The model can automatically adjust its reasoning duration based on task complexity, ensuring efficient performance without exhausting the budget unnecessarily.

Even with the thinking feature disabled, developers can leverage the model's speed improvements over 2.0 Flash. The preview of Gemini 2.5 Flash, including its reasoning capabilities, is now available via the Gemini API and in a dedicated dropdown within the Gemini app.

Apr 22, 2025 by Paul

maxbar1 found this interesting

MORE ABOUT: #AI Chatbots #Large Language Model (LLM) Tools #AI Writing Tools #Google Gemini

Google Gemini

162

AI Chatbot
Freemium
Proprietary

Google Gemini is an AI chatbot providing direct access to Google AI for assistance with writing, planning, learning, and more. Rated 3.2, it features AI-powered capabilities, ad-free usage, and spell checking. For those exploring alternatives, consider ChatGPT, HuggingChat, or Perplexity as potential options.

Related news

All news about Google Gemini »

External links

Start building with Gemini 2.5 Flash
Google • Official source
Gemini 2.5 Flash is now in preview
Google • Official source
Google rolls out Gemini 2.5 Flash preview on April 17
Mashable
Google reveals Gemini 2.5 Flash, its 'most cost-efficient thinking model'
ZDNET
Gemini 2.5 Flash comes to the Gemini app as Google seeks to improve “dynamic thinking”
Ars Technica
Google's Gemini 2.5 Flash introduces 'thinking budgets' that cut AI costs by 600% when turned down
VentureBeat

No comments so far, maybe you want to be first?