Unifying Speed, Depth, and Tool Access
Weâre excited to bring you three new, gameâchanging models on Requestyâthe alternative to Openrouter, LiteLLM and other routing platforms:
GPTâ4.1: Balanced speed + extended reasoning in one LLM.
OpenAI o4âmini: A costâefficient, âminiâ model that still excels at math, coding, and more.
OpenAI o3: Our advanced reasoning powerhouse for multi-step tasks, image analysis, and agentic workflows.
With Requesty, these models integrate seamlessly into the coding and chat tools you already useâRoo Code, Cline, OpenWebUI, and moreâhelping you solve bigger problems, faster.
Highlights & Benchmarks
1. Unified AI Reasoning
GPTâ4.1 merges rapid Q&A with deeper, stepâbyâstep thinkingâno need to pick separate âfastâ or âreflectiveâ variants. The same goes for o3 and o4âmini, each offering:
Tool Integration: They can search the web, run Python code, manipulate images, or orchestrate multi-step logic.
Extended vs. Instant: Configure a âthinking tokensâ limit if you want to force quicker or more inâdepth reasoning.
2. Coding & Developer Performance
All three models show major improvements in real coding tasksâfrom multiâfile refactoring to advanced debugging. They handle code diffs more reliably, and can integrate with dev tools:
Roo Code: For inline coding suggestions, multiâfile refactoring, advanced debugging, and minimal extraneous edits.
Cline: Quick, powerful dev and terminal integration for short or extended coding tasks.
OpenWebUI: A user-friendly chat environment that seamlessly pairs your conversation with tool usageâjust pick the model in settings.
Benchmarks highlight these capabilities:
SWE-Bench Verified
GPTâ4.1 / o3 surpass older GPT or oâseries models, with 60â70% pass rates in real software engineering tasks.
o4âmini still outperforms older âminiâ models at significantly lower cost.
Aider Polyglot (Code Editing)
GPTâ4.1 / o3 approach an ~80% success rate for multiâlanguage editing in diff formats.
o4âmini offers ~60â70% success at minimal expense.
3. Visual & Multimodal Mastery
GPTâ4.1 and o3 interpret images natively, solving tasks that combine textual + visual reasoning. They can zoom, rotate, or transform images in the context of an agentic workflow. Evaluations like MMMU (college-level visual tasks) or MathVista (visual math) show:
o3 near stateâofâtheâart for advanced figure reasoning.
o4âmini remains strong in baseline visual tasks.
4. Agentic Tool Use
All three models:
Learn to reason about when and how to call external tools (search, code, or image transformations).
Switch strategies midâconversationâlike reâsearching if new info is found.
Reliably produce the final answer in a requested format, typically under a minute.
5. Verified Safety & Reliability
Each model has updated refusal prompts, better injection resistance, and advanced filters for harmful queries. We extensively tested them in line with Requestyâs safety protocols, ensuring consistent guardrailsâwhile reducing needless refusals for normal user prompts.
Pricing on Requesty
Weâre keeping it simple: pay for tokens used (and an additional fee per tool call, if any). Below are prices per 1 million tokens:
GPTâ4.1
Input: $2.00
Cached Input: $0.50
Output: $8.00
OpenAI o4âmini
Input: $1.10
Cached Input: $0.275
Output: $4.40
OpenAI o3
Input: $10.00
Cached Input: $2.50
Output: $40.00
(Tool callsâe.g. web search, code executionâcost extra per call; see Requesty docs for details.)
Integrating with Your Favorite Tools
Roo Code
In the advanced settings, pick your Requesty endpoint and choose the model.
Enjoy multiâfile editing, advanced debugging, and synergy with your local dev environment.
Cline
In the advanced settings, pick your Requesty endpoint and choose the model.
Toggle âextended reasoningâ if you want GPTâ4.1 or o3 to think more thoroughly before responding.
OpenWebUI
Open your Providers panel, set Requestyâs base URL, and pick your new model.
Watch them handle free-flow chat and tool usage on demand.
(Prefer a different interface? Our partners include Aider, Goose, Crew AI, and many moreâjust select the new Requesty models in their respective settings.)
Why Choose Requesty?
Alternative to OpenRouter We offer a straightforward developer experience, unified billing, and special features like advanced caching or custom tool combos.
Tooling Ecosystem Our platform is designed to host multiple tool endpoints for each modelâscript your entire agentic workflow in one place.
Transparent Pricing We highlight input vs. cached input vs. output tokens, so you always know what youâll pay. Tools are payâperâcall, letting you scale based on your usage.
Flexibility + Power From quick Q&A with o4âmini to inâdepth multimodal tasks with o3, or the perfect mid-ground with GPTâ4.1, you can always find the right model for the job.
Ready to Explore?
Get your entire app pipeline running on GPTâ4.1, o4âmini, or o3 with tool integrationâand watch how swiftly you can code, create, or solve complex problems.
Thank you for choosing Requesty, your ultimate alternative to other routing platforms. We canât wait to see what you build with these new models!