OpenAI Responses
gpt-5.4-nano
GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency use cases such as classification, data extraction, ranking, and sub-agent execution. The model prioritizes responsiveness and efficiency over deep reasoning, making it ideal for pipelines that require fast, reliable outputs at scale. GPT-5.4 nano is well suited for background tasks, real-time systems, and distributed agent architectures where minimizing cost and latency is essential.
πVisionπ§ Reasoningπ§Tool callingβ‘Caching
Pricing per 1M tokens
Input
$0.20
Output
$1.25
Cache write
$1.25
Cache read
$0.02
Specifications
Context window400K tokens
Max output128K tokens
API typechat
AddedMar 18, 2026
Model IDopenai-responses/gpt-5.4-nano
Privacy & data
Data retentionYes (30 days)
Used for trainingNo
Provider locationπΊπΈ US
Privacy policyOpenAI Privacy Policy β
