DeepInfra Inc.

Qwen/QwQ-32B

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique ability to switch seamlessly between a thinking mode for complex reasoning and a non-thinking mode for efficient dialogue ensures versatile, high-quality performance. Significantly outperforming prior models like QwQ and Qwen2.5, Qwen3 delivers superior mathematics, coding, commonsense reasoning, creative writing, and interactive dialogue capabilities. The Qwen3-30B-A3B variant includes 30.5 billion parameters (3.3 billion activated), 48 layers, 128 experts (8 activated per task), and supports up to 131K token contexts with YaRN, setting a new standard among open-source models.

Pricing

$0.12
Input tokens per million
$0.18
Output tokens per million

Technical Specifications

Context Window
128K tokens
Max Output Tokens
Unlimited
Global Availability
Last Updated
N/A

Provider

DeepInfra Inc.
Location
πŸ‡ΊπŸ‡Έ US
Visit Website β†’

Privacy & Data

Data Retention
No
Used for Training
No
DeepInfra Privacy Policy β†’
Qwen/QwQ-32B - AI Model Details | Requesty | Requesty