News

Just weeks after unveiling Gemini 2.5 Pro, Google is on to its next top-performing model. On Thursday, the company released an "early version" of Gemini 2.5 Flash in preview in the Gemini API, AI ...
For prompts up to 200,000 tokens, Gemini 2.5 Pro costs $1.25 per million input tokens (roughly 750,000 words, longer than the entire “Lord of The Rings” series) and $10 per million output tokens.
In contrast to explicit caching, implicit caching is automatic. Enabled by default for Gemini 2.5 models, it passes on cost savings if a Gemini API request to a model hits a cache.
TL;DR Key Takeaways : Gemini AI’s new implicit caching feature reduces token costs by up to 75% for its 2.5 reasoning models by automatically applying discounts for repeated prompt prefixes.