Skip to content
Novita AI

Novita AI

Inference API Provider

Reputation:
72/100
novita.ai

Provider Overview

Type

inference

Billing

Per token

Egress

Free

SLA Uptime

99.5%

Autoscaling

Yes

Cold Start

None

Model Pricing (9)

ModelInput $/MOutput $/MLatencyThroughputContext
llama-3.1-8bCheapest$0.08$0.080.2s180 t/s128k
qwen-2.5-7b$0.10$0.100.18s170 t/s32k
mixtral-8x7b$0.20$0.200.25s110 t/s33k
deepseek-v3$0.35$0.350.4s60 t/s64k
llama-3.1-70b$0.49$0.490.4s70 t/s128k
llama-3.3-70b$0.49$0.490.35s75 t/s128k
qwen-2.5-72b$0.49$0.490.4s70 t/s32k
llama-3.1-405b$2.00$2.000.8s30 t/s128k
deepseek-r1$2.00$5.002.5s25 t/s64k

Reputation Details

Pricing
70
Reliability
70
Features
75

Highlights

  • Good pricing
  • Autoscaling supported
  • Fast cold start

Compare with Others

ProviderOverallPricingReliabilityFeaturesModels
Novita AI727070759
Together AI7870907520
Fireworks AI7870907514
Groq8690907510
DeepInfra8690907521