Google LLC (Vertex AI)

Google Cloud's enterprise AI platform with comprehensive MLOps.

πŸ“ πŸ‡ΊπŸ‡Έ US / πŸ‡ͺπŸ‡Ί EUβ€’78 models availableβ€’Visit Website β†’
78
Available Models
$1.99
Avg Input Price/M
$0.3
Cheapest Model
vertex/gemini-2.5-flash-image-preview@europe-west4
$3.00
Most Expensive
vertex/anthropic/claude-3-7-sonnet@europe-west1

Features Overview

78
Vision Support
66
Advanced Reasoning
78
Caching Support
56
Computer Use

Privacy & Data Policy

Data Retention

No data retention

Location

πŸ‡ΊπŸ‡Έ US / πŸ‡ͺπŸ‡Ί EU

All Google LLC (Vertex AI) Models

View All Providers β†’
Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-4-opus

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

claude-3-7-sonnet (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-4-1-opus (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

claude-3-7-sonnet

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

google/gemini-2.5-pro

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-4-1-opus

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Google LLC (Vertex AI)

claude-4-sonnet

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

google/gemini-2.5-flash

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-4-opus (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Google LLC (Vertex AI)

google/gemini-2.5-pro (us-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-4-sonnet (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-4-sonnet (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

anthropic/claude-4-opus

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

anthropic/claude-3-7-sonnet

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

anthropic/claude-4-sonnet

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

anthropic/claude-4-sonnet-latest

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

google/gemini-2.5-pro (us-east1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

claude-4-1-opus (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-4-opus (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

google/gemini-2.5-pro (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

claude-3-5-sonnet (europe-west1)

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

claude-3-5-sonnet (us-east5)

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Google LLC (Vertex AI)

claude-3-5-sonnet

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Google LLC (Vertex AI)

gemini-2.5-flash-image-preview

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

google/gemini-2.5-pro (us-south1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs β€œthinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

anthropic/claude-3-5-sonnet

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-3-7-sonnet (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

anthropic/claude-4-1-opus

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Ready to use Google LLC (Vertex AI) models?

Access all Google LLC (Vertex AI) models through Requesty's unified API with intelligent routing, caching, and cost optimization.

Google LLC (Vertex AI) AI Models - Pricing & Features | Requesty | Requesty