OpenAI Responses
OpenAI models optimized for structured response formats and agentic coding tasks.
Features Overview
Privacy & Data Policy
All OpenAI Responses Models
View All Providers âo3-pro
The o3 series of models are trained with reinforcement learning to perform complex reasoning. o1 models think before they answer, producing a long internal chain of thought before responding to the user. The o1 reasoning model is designed to solve hard problems across domains. The knowledge cutoff for o1 and o1-mini models is October, 2023.
gpt-5-pro
GPT-5 Pro is OpenAIâs extended-reasoning tier of GPT-5, built to push reliability on hard problems, long tool chains, and agentic workflows. It keeps GPT-5âs multimodal skills and very large context (API page lists up to 400K tokens) while allocating more compute to think longer and plan better, improving code generation, math, and complex writing beyond standard GPT-5/âThinking.â OpenAI positions Pro as the version that âuses extended reasoning for even more comprehensive and accurate answers,â targeting high-stakes tasks and enterprise use.
gpt-4.1-mini
GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruction evals, 35.8% on MultiChallenge, and 84.1% on IFEval. Mini also shows strong coding ability (e.g., 31.6% on Aiderâs polyglot diff benchmark) and vision understanding, making it suitable for interactive applications with tight performance constraints.
gpt-4.1-nano
For tasks that demand low latency, GPTâ4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding â even higher than GPTâ4o mini. Itâs ideal for tasks like classification or autocompletion.
gpt-5.1-codex
GPT-5.1-Codex is a version of GPT-5 optimized for agentic coding tasks in Codex or similar environments
gpt-5-nano
GPT-5 nano is OpenAI's fastest, cheapest version of GPT-5. It's great for summarization and classification tasks.
o3-mini
o3-mini is OpenAI's most recent small reasoning model, providing high intelligence at the same cost and latency targets of o1-mini. o3-mini also supports key developer features, like Structured Outputs, function calling, Batch API, and more. Like other models in the o-series, it is designed to excel at science, math, and coding tasks.
gpt-5-mini
GPT-5 mini is a faster, more cost-efficient version of GPT-5. It's great for well-defined tasks and precise prompts.
gpt-5
GPT-5 is OpenAI's flagship model for coding, reasoning, and agentic tasks across domains.
gpt-4.1
GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.
gpt-5-codex
GPT-5-Codex is a version of GPT-5 optimized for agentic coding tasks in Codex or similar environments
o4-mini
o3-mini is OpenAI's most recent small reasoning model, providing high intelligence at the same cost and latency targets of o1-mini. o3-mini also supports key developer features, like Structured Outputs, function calling, Batch API, and more. Like other models in the o-series, it is designed to excel at science, math, and coding tasks.
Ready to use OpenAI Responses models?
Access all OpenAI Responses models through Requesty's unified API with intelligent routing, caching, and cost optimization.