Google AI Developers announces new Gemini API inference tiers, offering Flex for 50 percent lower cost with latency tolerance and Priority for higher reliability at premium pricing, signaling more granular cost reliability tradeoffs.
Balance cost & reliability with our new Flex & Priority inference tiers in the Gemini API!
Flex: Pay 50% less for cost-sensitive & latency-tolerant workloads
Priority: Highest reliability for your most critical, interactive apps (with premium pricing)
This finding is one of many signals tracked across Artificial Intelligence. The live feed updates every few hours with new expert voices, debates, and emerging ideas.
← Back to Artificial Intelligence