Google LLC (Vertex AI)
google/gemini-2.5-flash (us-west1)
Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.
Vision Support
Advanced Reasoning
Caching Support
Pricing
$0.3
Input tokens per million
$2.50
Output tokens per million
Caching Pricing
$0.55
Cache write per million
$0.07
Cache read per million
Technical Specifications
Context Window
1.0M tokens
Max Output Tokens
66K tokens
Global Availability
Last Updated
N/A