Requesty
OpenAI Inc.

gpt-5.4-mini

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding, and tool use, while reducing latency and cost for large-scale deployments. The model is designed for production environments that require a balance of capability and efficiency, making it well suited for chat applications, coding assistants, and agent workflows that operate at scale. GPT-5.4 mini delivers reliable instruction following, solid multi-step reasoning, and consistent performance across diverse tasks with improved cost efficiency.

πŸ‘Vision🧠ReasoningπŸ”§Tool calling⚑Caching

Pricing per 1M tokens

Input
$0.75
Output
$4.50
Cache write
$4.50
Cache read
$0.07

Specifications

Context window400K tokens
Max output128K tokens
API typechat
AddedMar 18, 2026
Model IDopenai/gpt-5.4-mini

Privacy & data

Data retentionYes (30 days)
Used for trainingNo
Provider locationπŸ‡ΊπŸ‡Έ US