Google LLC (Vertex AI)
gemini-2.5-flash
Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.
πVisionπ§ Reasoningπ§Tool callingβ‘Caching
Pricing per 1M tokens
Input
$0.30
Output
$2.50
Cache write
$0.55
Cache read
$0.07
Specifications
Context window1.0M tokens
Max output66K tokens
API typechat
AddedMay 20, 2025
Model IDvertex/gemini-2.5-flash
Privacy & data
Data retentionNo
Used for trainingNo
Provider locationπΊπΈ US / πͺπΊ EU
Privacy policyVertex AI Data Governance β
