Requesty just became the most affordable place to run Googleâs latest Gemini models. Thanks to the optimisations we rolled out for our new enforced-caching layer, weâre slashing prices by a full 50 % on both Gemini 2.5 Pro and Gemini 2.5 Flashâeffective immediately.
Whatâs New?
1. Half-Price Gemini Models
Model | Input (old â new ) | Output (old â new ) |
Gemini 2.5 Pro
| $1.25 â $0.62 per M tokens | $10.00 â $5.00 per M tokens |
Gemini 2.5 Flash
| $0.15 â $0.075 per M tokens | $3.50 â $1.75 per M tokens |
Thatâs the best public pricing youâll findâbeating OpenRouter by a mile.
2. Smarter, Cheaper Caching
Our one-hour prompt cache now covers Gemini calls too. Repeat inputs in agent loops hit the cache at just 25 % of the already-discounted input rate, so complex chains run faster and cheaper.
3. Plug-and-Play Tooling (No Code Changes)
Point your favourite environment at the Requesty endpoint, pick coding/gemini-2.5-pro or coding/gemini-2.5-flash, and youâre off. Works everywhere you do:
Cline â terminal-first power with âextended reasoningâ toggle
OpenWebUI
Kilo Code
Roo Code â inline multi-file refactors on the cheap
Aider & Continue â persistent sessions, now at half the cost
âŠand any custom stack that speaks HTTP.
Why Choose Requesty for Gemini 2.5?
Requesty Advantage | What it Means for You |
Enforced Prompt Caching | Up to 75 % lower spend on iterative prompts |
Unified Billing | One invoice for Google, Anthropic, OpenAI & more |
Parallel Tool Execution | Web search, Python sandbox & file ops in a single call |
Rock-Solid Routing | Automatic retries & multi-zone failover keep your app live |
Transparent Analytics | Real-time dashboards for tokens, cache hits & latency |
Get Started in 3 Minutes
Sign in / Create your Requesty account (new users still get $6 free credits).
Enable Gemini 2.5: Navigate to Models â Google and flip on
gemini-2.5-proorgemini-2.5-flash.Toggle Enforced Caching in Project Settings to unlock the biggest savings.
Update Your Tool Config:
jsoncCopyEdit// cline / roo / etc. { "endpoint": "https://router.requesty.ai/v1", "model": "coding/gemini-2.5-pro" // or gemini-2.5-flash }
Thatâs itâship your next feature, run larger evals, or spin up agentic workflows for half the usual cost.
Limited-Time Pricing, Unlimited Possibilities
Whether youâre building a lightning-fast chat assistant or a multi-hour autonomous coder, Gemini 2.5 on Requesty now offers frontier performance at bargain-bin prices. Dive in, break things (cheaply), and tell us what else youâd like to see.