Industry Market And Company MovesMarket Move

Gemini API Flex and Priority inference tiers

April 2, 2026Google AI Developers

Google AI Developers announces new Gemini API inference tiers, offering Flex for 50 percent lower cost with latency tolerance and Priority for higher reliability at premium pricing, signaling more granular cost reliability tradeoffs.

Balance cost & reliability with our new Flex & Priority inference tiers in the Gemini API!
Flex: Pay 50% less for cost-sensitive & latency-tolerant workloads
Priority: Highest reliability for your most critical, interactive apps (with premium pricing)
Google AI Developers
apipricingreliabilitygoogleapigemini

See what experts are saying right now

This finding is one of many signals tracked across Artificial Intelligence. The live feed updates every few hours with new expert voices, debates, and emerging ideas.

← Back to Artificial Intelligence