
Google LLC (Vertex AI)
Google Cloud's enterprise AI platform with comprehensive MLOps. Requesty routes to 66 Google LLC (Vertex AI) models starting at $0.10 per 1M input tokens with context windows up to 1.0M tokens. One API key, OpenAI-compatible SDK, no markup.
Flagship model
kimi-k2MMLU Pro
82.3%
GPQA
70.0%
HumanEval
89.9%
SWE-Bench
65.8%
All Google LLC (Vertex AI) models
| Model | Context | Max Output | Input/1M | Output/1M | Capabilities | SWE-Bench |
|---|---|---|---|---|---|---|
kimi-k2 | 262K | 262K | $0.60 | $2.50 | ππ§ π§β‘ | 66% |
deepseek-v3.2 | 164K | 66K | $0.56 | $1.68 | ππ§ π§β‘ | β |
gemini-3.1-flash-lite-preview | 1.0M | 66K | $0.25 | $1.50 | ππ§β‘ | β |
gemini-3.1-flash-image-preview | 131K | 33K | $0.50 | $2.00 | ππ§ π§β‘ | β |
gemini-3-pro-preview | 1.0M | 66K | $2.00 | $12.00 | ππ§ π§β‘ | β |
gemini-3.1-pro-preview | 1.0M | 66K | $2.00 | $12.00 | ππ§ π§β‘ | β |
claude-opus-4-6 @us-east5 | 1M | 128K | $5.50 | $27.50 | ππ§ π§β‘π₯ | 75% |
claude-opus-4-6 | 1M | 128K | $5.00 | $25.00 | ππ§ π§β‘π₯ | 75% |
claude-opus-4-6 @europe-west1 | 1M | 128K | $5.50 | $27.50 | ππ§ π§β‘π₯ | 75% |
gemini-3-flash-preview | 1.0M | 66K | $0.50 | $3.00 | ππ§ π§β‘ | β |
claude-opus-4-5 | 200K | 64K | $5.00 | $25.00 | ππ§ π§β‘π₯ | 71% |
claude-opus-4-5 @europe-west1 | 200K | 64K | $5.50 | $27.50 | ππ§ π§β‘π₯ | 71% |
claude-opus-4-5 @us-east5 | 200K | 64K | $5.50 | $27.50 | ππ§ π§β‘π₯ | 71% |
gemini-2.5-flash-image | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | β |
claude-opus-4-1 | 200K | 32K | $15.00 | $75.00 | ππ§ π§β‘π₯ | β |
claude-opus-4 @us-east5 | 200K | 32K | $15.00 | $75.00 | ππ§ π§β‘π₯ | β |
claude-haiku-4-5 @europe-west1 | 200K | 64K | $1.10 | $5.50 | ππ§β‘π₯ | 54% |
claude-haiku-4-5 @us-east5 | 200K | 64K | $1.10 | $5.50 | ππ§β‘π₯ | 54% |
claude-haiku-4-5 | 200K | 64K | $1.00 | $5.00 | ππ§β‘π₯ | 54% |
claude-sonnet-4-5 @europe-west1 | 200K | 64K | $3.30 | $16.50 | ππ§ π§β‘π₯ | 71% |
claude-sonnet-4 @us-east5 | 200K | 64K | $3.00 | $15.00 | ππ§ π§β‘π₯ | 65% |
claude-sonnet-4-5 @us-east5 | 200K | 64K | $3.30 | $16.50 | ππ§ π§β‘π₯ | 71% |
claude-opus-4 | 200K | 32K | $15.00 | $75.00 | ππ§ π§β‘π₯ | β |
claude-opus-4-1 @us-east5 | 200K | 32K | $15.00 | $75.00 | ππ§ π§β‘π₯ | β |
claude-sonnet-4-5 | 200K | 64K | $3.00 | $15.00 | ππ§ π§β‘π₯ | 71% |
claude-opus-4-1 @europe-west1 | 200K | 32K | $15.00 | $75.00 | ππ§ π§β‘π₯ | β |
claude-sonnet-4 @europe-west1 | 200K | 64K | $3.00 | $15.00 | ππ§ π§β‘π₯ | 65% |
claude-sonnet-4 | 200K | 64K | $3.00 | $15.00 | ππ§ π§β‘π₯ | 65% |
claude-opus-4 @europe-west1 | 200K | 32K | $15.00 | $75.00 | ππ§ π§β‘π₯ | β |
gemini-2.5-flash-lite @europe-central2 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash-lite @europe-west4 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash-lite | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash @us-east1 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash @us-west1 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash @europe-central2 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash @europe-west4 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash @europe-north1 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash @europe-west1 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash-lite @europe-north1 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash-lite @europe-west1 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash @europe-west8 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash-lite @us-central1 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash-lite @us-east1 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash-lite @us-east5 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash-lite @us-south1 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash @us-east5 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash-lite @us-west1 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-2.5-flash @us-central1 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash @us-south1 | 1.0M | 66K | $0.30 | $2.50 | ππ§ π§β‘ | 53% |
gemini-2.5-flash-lite @europe-west8 | 1.0M | 66K | $0.10 | $0.40 | ππ§ π§β‘ | β |
gemini-3-pro-image-preview | 1.0M | 33K | $2.00 | $12.00 | ππ§ π§β‘ | β |
gemini-2.5-pro @europe-central2 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @us-east5 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @europe-west4 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @europe-north1 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @europe-west8 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @us-west1 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @europe-west1 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @us-central1 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @us-south1 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
gemini-2.5-pro @us-east1 | 1.0M | 66K | $1.25 | $10.00 | ππ§ π§β‘π₯ | 64% |
claude-3-7-sonnet @europe-west1 | 200K | 64K | $3.00 | $15.00 | ππ§ π§β‘π₯ | 62% |
claude-3-7-sonnet | 200K | 64K | $3.00 | $15.00 | ππ§ π§β‘π₯ | 62% |
claude-3-7-sonnet @us-east5 | 200K | 64K | $3.00 | $15.00 | ππ§ π§β‘π₯ | 62% |
About Google LLC (Vertex AI) on Requesty
How many Google LLC (Vertex AI) models are available through Requesty?
Requesty routes to 66 Google LLC (Vertex AI) models including regional variants, with pricing synced in real time to the upstream provider.
What is the cheapest Google LLC (Vertex AI) model?
The cheapest Google LLC (Vertex AI) model starts at $0.10 per million input tokens. See the pricing column in the table below for full per-model rates.
Does Requesty add markup on Google LLC (Vertex AI) pricing?
No. Requesty passes through exactly what Google LLC (Vertex AI) charges. You pay the same per-token rates as going direct β plus you get smart routing, caching, analytics, and one unified API for 400+ models.
Is my data used to train Google LLC (Vertex AI) models?
Google LLC (Vertex AI)'s terms state that API data is not used for training. See their privacy policy for the authoritative statement.
Where are Google LLC (Vertex AI) models hosted?
Google LLC (Vertex AI) models are hosted in πΊπΈ US / πͺπΊ EU. Some models are available in additional regions through AWS Bedrock, Azure, or Google Vertex AI β filter by region on the Google LLC (Vertex AI) rows in the models explorer.
