Gemini 2.5 Flash
Fast, Efficient AI with Controllable Reasoning
2025-04-18

Gemini 2.5 Flash, is now in preview, offering improved reasoning while prioritizing speed and cost efficiency for developers.
Gemini 2.5 Flash is a fast, cost-efficient AI model designed for developers, now available in preview. It enhances reasoning capabilities while maintaining the speed and affordability of its predecessor, Gemini 2.0 Flash. This hybrid model introduces controllable reasoning, allowing developers to toggle thinking processes on or off. Users can set thinking budgets to balance quality, cost, and latency, ensuring optimal performance for various tasks. Even with thinking disabled, the model retains 2.0 Flash's speed while improving accuracy. Gemini 2.5 Flash excels in complex tasks like math problems and data analysis, ranking highly in benchmark tests. Available via Google AI Studio and Vertex AI, it offers flexible customization, from zero reasoning for minimal latency to extended thinking for deeper analysis. Developers can experiment with adjustable budgets to tailor responses for specific needs. Future updates will further refine its capabilities before full production release.
API
Artificial Intelligence
Development