Back to AI Briefing
Google AI Blog
April 2, 2026
New ways to balance cost and reliability in the Gemini API
Quick Summary
"Google is introducing two new inference tiers to the Gemini API, Flex and Priority, to balance cost and latency."
This article was originally published by Google AI Blog. You can read the full, in-depth story at the source below.
Read Full Story at Google AI Blog