Google LLC (Vertex AI)

google/gemini-2.5-flash (us-west1)

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision Support

Advanced Reasoning

Caching Support

Pricing

$0.3

Input tokens per million

$2.50

Output tokens per million

Caching Pricing

$0.55

Cache write per million

$0.07

Cache read per million

Technical Specifications

Context Window

1.0M tokens

Max Output Tokens

66K tokens

Global Availability

Last Updated

N/A

Provider

Google LLC (Vertex AI)

Location

🇺🇸 US / 🇪🇺 EU

Visit Website →

Privacy & Data

Data Retention

Used for Training

Vertex AI Data Governance →

Get Started

Try with Requesty Browse All Google LLC (Vertex AI) Models →