
Google has introduced updates throughout its Gemini 2.5 household of reasoning fashions, together with making Gemini 2.5 Professional and Flash usually obtainable and introducing a preview of Gemini 2.5 Flash-Lite.
Based on Google, no modifications have been made to Professional and Flash for the reason that final preview, apart from the pricing for Flash is completely different. When these fashions had been first introduced, there was separate pondering and non-thinking pricing, however Google mentioned that separation led to confusion amongst builders.
The brand new pricing for two.5 Flash is identical for each pondering and non-thinking modes. The costs at the moment are $0.30/1 million enter tokens for textual content, picture, and video, $1.00/ 1 million enter tokens for audio, and $2.50/1 million output tokens for all. This represents a rise in enter price and a lower in output price.
“Whereas we try to keep up constant pricing between preview and secure releases to attenuate disruption, this can be a particular adjustment reflecting Flash’s distinctive worth, nonetheless providing the very best cost-per-intelligence obtainable,” Google wrote in a weblog put up.
Google additionally launched a preview of Gemini 2.5 Flash-Lite, which has the bottom latency and value among the many 2.5 fashions. The corporate sees this as a cheap improve from 1.5 and a pair of.0 Flash, with higher efficiency throughout most evaluations, decrease time to first token, and better tokens per second decode.
Gemini 2.5 Flash-Lite additionally permits customers to regulate the pondering price range through an API parameter. For the reason that mannequin is designed for price and pace effectivity, pondering is turned off by default.
The brand new mannequin additionally helps Google’s native instruments together with Grounding with Google Search, Code Execution, URL Context, and performance calling.
The pricing for Gemini 2.5 Flash-Lite is $0.10/1 million enter tokens for textual content, picture, and video, $0.50/ 1 million enter tokens for audio, and $.40/1 million output tokens for all.