Google LLC (Vertex AI)

Google Cloud's enterprise AI platform with comprehensive MLOps.

📍 🇺🇸 US / 🇪🇺 EU•60 models available•Visit Website →
60
Available Models
$2.77
Avg Input Price/M
$0.1
Cheapest Model
vertex/gemini-2.5-flash-lite@europe-north1
$15.00
Most Expensive
vertex/claude-opus-4-1@us-east5

Features Overview

60
Vision Support
54
Advanced Reasoning
60
Caching Support
35
Computer Use

Privacy & Data Policy

Data Retention

No data retention

Location

🇺🇸 US / 🇪🇺 EU

All Google LLC (Vertex AI) Models

View All Providers →
Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

gemini-2.5-flash (us-east5)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

gemini-2.5-flash-lite (us-east1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

gemini-2.5-pro (us-central1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

gemini-2.5-pro (us-east1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

claude-sonnet-4-5

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

gemini-2.5-flash-lite (us-west1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

claude-3-7-sonnet (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

gemini-2.5-flash-lite

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

claude-opus-4-5

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$5.00/M tokens
Output
$25.00/M tokens
Google LLC (Vertex AI)

gemini-2.5-pro (europe-west4)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

claude-opus-4-5 (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$5.00/M tokens
Output
$25.00/M tokens
Google LLC (Vertex AI)

claude-3-5-sonnet (europe-west1)

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Google LLC (Vertex AI)

claude-haiku-4-5

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$1.00/M tokens
Output
$5.00/M tokens

Anthropic Haiku 4.5

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

gemini-2.5-pro

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

gemini-2.5-pro (europe-west8)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

gemini-2.5-flash (us-south1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

gemini-2.5-flash (europe-west4)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

gemini-2.5-pro (europe-north1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

claude-sonnet-4-5 (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

gemini-3-pro-preview

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$2.00/M tokens
Output
$12.00/M tokens

Gemini 3 Pro is designed to tackle the most challenging agentic problems with strong coding and state-of-the-art reasoning capabilities. It is the best model for complex multimodal understanding. Compared to Gemini 2.5 Pro, it improves significantly on complex instruction following and delivers outcomes with better output efficiency.

Google LLC (Vertex AI)

gemini-2.5-flash-image-preview

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$30.00/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

claude-3-5-sonnet

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Google LLC (Vertex AI)

claude-sonnet-4

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

gemini-3-pro-image-preview

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
33K tokens
Input
$2.00/M tokens
Output
$12.00/M tokens

Gemini 3 Pro Image, or Gemini 3 Pro (with Nano Banana), is designed to tackle the most challenging image generation by incorporating state-of-the-art reasoning capabilities. It's the best model for complex and multi-turn image generation and editing, having improved accuracy and enhanced image quality.

Google LLC (Vertex AI)

claude-3-7-sonnet (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

gemini-2.5-flash-lite (us-east5)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

gemini-2.5-pro (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

gemini-2.5-flash (europe-west8)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

claude-opus-4-1 (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$15.00/M tokens
Output
$75.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-haiku-4-5 (europe-west1)

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$1.00/M tokens
Output
$5.00/M tokens

Anthropic Haiku 4.5

Google LLC (Vertex AI)

claude-sonnet-4-5 (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-opus-4

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$15.00/M tokens
Output
$75.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

gemini-2.5-flash

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

claude-opus-4 (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$15.00/M tokens
Output
$75.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

gemini-2.5-pro (europe-central2)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

gemini-2.5-pro (us-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

gemini-2.5-flash (us-east1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

gemini-2.5-flash (europe-north1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

claude-opus-4-1 (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$15.00/M tokens
Output
$75.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-3-5-sonnet (us-east5)

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's previous most intelligent model. High level of intelligence and capability. Excells in coding.

Google LLC (Vertex AI)

gemini-2.5-flash-lite (us-south1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

gemini-2.5-flash (europe-west1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

claude-sonnet-4 (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

gemini-2.5-flash (us-central1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Google LLC (Vertex AI)

gemini-2.5-flash (us-west1)

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.3/M tokens
Output
$2.50/M tokens

Google's first hybrid reasoning model which supports a 1M token context window and has thinking budgets. Most balanced Gemini model, optimized for low latency use cases.

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

gemini-2.5-pro (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

claude-opus-4 (us-east5)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$15.00/M tokens
Output
$75.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-opus-4-5 (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$5.00/M tokens
Output
$25.00/M tokens
Google LLC (Vertex AI)

claude-opus-4-1

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$15.00/M tokens
Output
$75.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-haiku-4-5 (us-east5)

Vision
Caching
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$1.00/M tokens
Output
$5.00/M tokens

Anthropic Haiku 4.5

Vision
Caching
Reasoning
Context Window
1.0M tokens
Max Output
66K tokens
Input
$0.1/M tokens
Output
$0.4/M tokens

Google's smallest and most cost effective model, built for at scale usage.

Google LLC (Vertex AI)

gemini-2.5-pro (us-south1)

Vision
Caching
Reasoning
Computer Use
Context Window
1.0M tokens
Max Output
66K tokens
Input
$1.25/M tokens
Output
$10.00/M tokens

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy and nuanced context handling. Gemini 2.5 Pro achieves top-tier performance on multiple benchmarks, including first-place positioning on the LMArena leaderboard, reflecting superior human-preference alignment and complex problem-solving abilities.

Google LLC (Vertex AI)

claude-sonnet-4 (europe-west1)

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Google LLC (Vertex AI)

claude-3-7-sonnet

Vision
Caching
Reasoning
Computer Use
Context Window
200K tokens
Max Output
64K tokens
Input
$3.00/M tokens
Output
$15.00/M tokens

Anthropic's most intelligent model. The first hybrid reasoning model on the market with the highest level of intelligence and capability with toggleable extended thinking. Top-tier results in reasoning, coding, multilingual tasks, long-context handling, honesty, and image processing.

Ready to use Google LLC (Vertex AI) models?

Access all Google LLC (Vertex AI) models through Requesty's unified API with intelligent routing, caching, and cost optimization.

Google LLC (Vertex AI) AI Models - Pricing & Features | Requesty