Google LLC (Gemini API)

Google's advanced AI model with multimodal capabilities.

πŸ“ 🌍 Globalβ€’10 models availableβ€’Visit Website β†’
10
Available Models
$0.45
Avg Input Price/M
$0.04
Cheapest Model
google/gemini-1.5-flash-8b-latest
$1.25
Most Expensive
google/gemini-1.5-pro-latest

Features Overview

4
Vision Support
3
Advanced Reasoning
3
Caching Support
0
Computer Use

Privacy & Data Policy

Data Retention

Yes

Location

🌍 Global

All Google LLC (Gemini API) Models

View All Providers β†’
Google LLC (Gemini API)

gemini-1.5-flash-8b-latest

Context Window
1.0M tokens
Max Output
8K tokens
Input
$0.04/M tokens
Output
$0.15/M tokens
Google LLC (Gemini API)

gemini-1.5-pro-latest

Context Window
2.1M tokens
Max Output
8K tokens
Input
$1.25/M tokens
Output
$5.00/M tokens
Google LLC (Gemini API)

gemini-2.5-pro

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Gemini API)

gemini-2.5-flash

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Gemini API)

gemini-1.5-flash-latest

Context Window
1.0M tokens
Max Output
8K tokens
Input
$0.07/M tokens
Output
$0.3/M tokens
Google LLC (Gemini API)

gemini-1.5-pro

Context Window
2.1M tokens
Max Output
8K tokens
Input
$1.25/M tokens
Output
$5.00/M tokens
Google LLC (Gemini API)

gemini-1.5-flash

Context Window
1.0M tokens
Max Output
8K tokens
Input
$0.07/M tokens
Output
$0.3/M tokens
Google LLC (Gemini API)

gemini-1.5-flash-8b

Context Window
1.0M tokens
Max Output
8K tokens
Input
$0.07/M tokens
Output
$0.3/M tokens
Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Gemini API)

gemini-2.0-flash-001

Vision
Context Window
1.0M tokens
Max Output
8K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to [Gemini Flash 1.5](/google/gemini-flash-1.5), while maintaining quality on par with larger models like [Gemini Pro 1.5](/google/gemini-pro-1.5). It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.

Ready to use Google LLC (Gemini API) models?

Access all Google LLC (Gemini API) models through Requesty's unified API with intelligent routing, caching, and cost optimization.

Google LLC (Gemini API) AI Models - Pricing & Features | Requesty