Jul 23, 2025 at 9:16 PM

Google Gemini 2.5 Flash-Lite launches with lower pricing and 1M context window

Gemini 2.5 Flash-Lite is now generally available after a month of early access. Designed for speed and affordability, it’s Google’s fastest Gemini model, now ready for production via Vertex AI and Google AI Studio. Compared to its predecessor, it offers quicker, more consistent performance in translation, basic coding, and multimodal input handling.

Google has also cut prices significantly: inputs are now $0.10 and outputs $0.40 per million tokens, with audio input rates reduced by 40%. Flash-Lite is now cheaper than alternatives like OpenAI’s o4-mini and Claude Sonnet 4 from Anthropic. It supports a one million-token context window, adjustable thinking budgets, code execution, and Google Search Grounding.

Developers using the preview must switch to gemini-2.5-flash-lite before the old alias is retired on August 25.

Jul 23, 2025 by Mauricio B. Holguin

MORE ABOUT: #AI Chatbots #Large Language Model (LLM) Tools #AI Writing Tools #Google Gemini

Google Gemini

161

AI Chatbot
Freemium
Proprietary

Google Gemini is an AI chatbot that provides direct access to Google AI, assisting with tasks such as writing, planning, and learning. Rated 3.3, it offers features like AI-powered interactions, an ad-free experience, and spell checking. Ideal for users seeking AI-driven support, it is comparable to other AI chatbots in the market.

External links

Gemini 2.5 Flash-Lite is now stable and generally available
Google Developers Blog • Official source
Gemini 2.5 Flash-Lite now 'generally available' following Google's month-long preview | Android Central
Android Central
Gemini 2.5 Flash-Lite is now generally available
Techloy
Google releases Gemini 2.5 Flash-Lite, its fastest and lowest cost AI model
Investing

No comments so far, maybe you want to be first?

Google Gemini 2.5 Flash-Lite launches with lower pricing and 1M context window

Related news

External links