Parasail

AI infrastructure for developers and enterprises.

📍 🇺🇸 US•7 models available•Visit Website →
7
Available Models
$0.93
Avg Input Price/M
$0.15
Cheapest Model
parasail/parasail-qwen3-235b-a22b-instruct-2507
$3.00
Most Expensive
parasail/parasail-deepseek-r1

Features Overview

1
Vision Support
0
Advanced Reasoning
0
Caching Support
0
Computer Use

Privacy & Data Policy

Data Retention

No data retention

Location

🇺🇸 US

All Parasail Models

View All Providers →
Context Window
131K tokens
Max Output
8K tokens
Input
$0.59/M tokens
Output
$2.10/M tokens
Context Window
131K tokens
Max Output
16K tokens
Input
$0.99/M tokens
Output
$2.99/M tokens
Vision
Context Window
33K tokens
Max Output
8K tokens
Input
$0.7/M tokens
Output
$0.7/M tokens
Context Window
128K tokens
Max Output
8K tokens
Input
$0.3/M tokens
Output
$0.5/M tokens

Gemma 3 1B is the smallest of the new Gemma 3 family. It handles context windows up to 32k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling. Note: Gemma 3 1B is not multimodal. For the smallest multimodal Gemma 3 model, please see [Gemma 3 4B](google/gemma-3-4b-it)

Context Window
262K tokens
Max Output
8K tokens
Input
$0.15/M tokens
Output
$0.85/M tokens
Context Window
64K tokens
Max Output
8K tokens
Input
$3.00/M tokens
Output
$3.00/M tokens

DeepSeek-R1-Distill-Qwen-7B is a 7 billion parameter dense language model distilled from DeepSeek-R1, leveraging reinforcement learning-enhanced reasoning data generated by DeepSeek's larger models. The distillation process transfers advanced reasoning, math, and code capabilities into a smaller, more efficient model architecture based on Qwen2.5-Math-7B. This model demonstrates strong performance across mathematical benchmarks (92.8% pass@1 on MATH-500), coding tasks (Codeforces rating 1189), and general reasoning (49.1% pass@1 on GPQA Diamond), achieving competitive accuracy relative to larger models while maintaining smaller inference costs.

Context Window
164K tokens
Max Output
Unlimited
Input
$0.79/M tokens
Output
$1.15/M tokens

Ready to use Parasail models?

Access all Parasail models through Requesty's unified API with intelligent routing, caching, and cost optimization.

Parasail AI Models - Pricing & Features | Requesty