Requesty
OpenAI Responses

gpt-5.4-nano

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency use cases such as classification, data extraction, ranking, and sub-agent execution. The model prioritizes responsiveness and efficiency over deep reasoning, making it ideal for pipelines that require fast, reliable outputs at scale. GPT-5.4 nano is well suited for background tasks, real-time systems, and distributed agent architectures where minimizing cost and latency is essential.

πŸ‘Vision🧠ReasoningπŸ”§Tool calling⚑Caching

Pricing per 1M tokens

Input
$0.20
Output
$1.25
Cache write
$1.25
Cache read
$0.02

Specifications

Context window400K tokens
Max output128K tokens
API typechat
AddedMar 18, 2026
Model IDopenai-responses/gpt-5.4-nano

Privacy & data

Data retentionYes (30 days)
Used for trainingNo
Provider locationπŸ‡ΊπŸ‡Έ US