Provider Overview
Type
inference
Billing
Per token
Egress
Free
SLA Uptime
99.5%
Autoscaling
Yes
Cold Start
None
Model Pricing (3)
| Model | Input $/M | Output $/M | Latency | Throughput | Context |
|---|---|---|---|---|---|
| deepseek-v2.5Cheapest | $0.14 | $0.28 | 0.3s | 50 t/s | 32k |
| deepseek-v3 | $0.27 | $1.10 | 0.3s | 60 t/s | 64k |
| deepseek-r1 | $0.55 | $2.19 | 3s | 20 t/s | 64k |
Reputation Details
Pricing
70
Reliability
70
Features
75
Highlights
- Good pricing
- Autoscaling supported
- Fast cold start
Compare with Others
| Provider | Overall | Pricing | Reliability | Features | Models |
|---|---|---|---|---|---|
| DeepSeek | 72 | 70 | 70 | 75 | 3 |
| Together AI | 78 | 70 | 90 | 75 | 20 |
| Fireworks AI | 78 | 70 | 90 | 75 | 14 |
| Groq | 86 | 90 | 90 | 75 | 10 |
| DeepInfra | 86 | 90 | 90 | 75 | 21 |