Skip to content

🏆 AI Model Performance Leaderboard

Compare 319AI models by quality, cost & value

319 Models·60 GPUs·19 Providers
🥇#1
Most Popular
Alibaba

Qwen 2.5 7B

Qwen 2.5 · 7.6B · 128K ctx

Quality
70
Cost $/M
$0.200
Value
350.0
Calculate ROI
🥈#2
Alibaba

Qwen 3 8B

Qwen 3 · 8.2B · 128K ctx

Quality
70
Cost $/M
$0.200
Value
350.0
Calculate ROI
🥉#3
Alibaba

Qwen 2.5 1.5B

Qwen 2.5 · 1.5B · 32K ctx

Quality
Cost $/M~
$0.027
Value
1862.0
Calculate ROI
#4
Alibaba

Qwen 2.5 3B

Qwen 2.5 · 3.1B · 32K ctx

Quality
58
Cost $/M
$0.100
Value
580.0
Calculate ROI
#5
Meta

Llama 3.1 8B

Llama 3.1 · 8B · 128K ctx

Quality
Cost $/M
$0.180
Value
322.2
Calculate ROI
#6
Alibaba

Qwen 3 4B

Qwen 3 · 4B · 128K ctx

Quality
57
Cost $/M
$0.100
Value
570.0
Calculate ROI
#7
Pareto Q×C×S
Meta

Llama 3.2 3B

Llama 3.2 · 3.2B · 128K ctx

Quality
55
Cost $/M
$0.060
Value
916.7
Calculate ROI
#8
Alibaba

Qwen 3 32B

Qwen 3 · 32.8B · 128K ctx

Quality
74
Cost $/M
$0.800
Value
92.5
Calculate ROI
#9
Pareto Q×C×S
Meta

Llama 3.2 1B

Llama 3.2 · 1.2B · 128K ctx

Quality
38
Cost $/M
$0.030
Value
1266.7
Calculate ROI
#10
Meta

Llama 3 8B

Llama 3 · 8B · 8K ctx

Quality
63
Cost $/M
$0.200
Value
315.0
Calculate ROI
#11
Meta

HelpSteer2 Llama 3.1 70B

Llama 3.1 · 70.6B · 128K ctx

Quality
82
Cost $/M
$0.500
Value
164.0
Calculate ROI
#12
Meta

Llama 3.1 70B

Llama 3.1 · 70.6B · 128K ctx

Quality
75
Cost $/M
$0.880
Value
85.2
Calculate ROI
#13
Meta

Llama 3.1 70B Turbo

Llama 3.1 · 70.6B · 128K ctx

Quality
Cost $/M
$0.880
Value
56.8
Calculate ROI
#14
Mistral

NV EmbedQA Mistral 7B

NV EmbedQA · 7.2B · 32K ctx

Quality
Cost $/M
$0.012
Value
4166.7
Calculate ROI
#15
Mistral

E5 Mistral 7B

E5 · 7.1B · 32K ctx

Quality
Cost $/M
$0.016
Value
3125.0
Calculate ROI
#16
Google

Gemma 3 1B

Gemma 3 · 1B · 32K ctx

Quality
35
Cost $/M~
$0.018
Value
1955.1
Calculate ROI
#17
Mistral

BioMistral 7B

BioMistral · 7.2B · 32K ctx

Quality
Cost $/M~
$0.129
Value
387.9
Calculate ROI
#18
Mistral

Mistral 7B

Mistral · 7.3B · 32K ctx

Quality
56
Cost $/M
$0.200
Value
280.0
Calculate ROI
#19
Meta

TinyLlama 1.1B Chat

TinyLlama · 1.1B · 2K ctx

Quality
Cost $/M~
$0.021
Value
2412.1
Calculate ROI
#20
Meta

TinyLlama 1.1B

TinyLlama · 1.1B · 2K ctx

Quality
Cost $/M~
$0.021
Value
2412.1
Calculate ROI
#21
Pareto Q×C×S
Alibaba

Qwen 2.5 14B

Qwen 2.5 · 14.8B · 128K ctx

Quality
76
Cost $/M
$0.400
Value
190.0
Calculate ROI
#22
Alibaba

Qwen 2.5 72B

Qwen 2.5 · 72.7B · 128K ctx

Quality
77
Cost $/M
$1.20
Value
64.2
Calculate ROI
#23
Microsoft

Phi 2

Phi · 2.7B · 2K ctx

Quality
Cost $/M~
$0.054
Value
931.0
Calculate ROI
#24
Alibaba

Qwen 2.5 32B

Qwen 2.5 · 32.5B · 128K ctx

Quality
73
Cost $/M
$0.800
Value
91.3
Calculate ROI
#25
DeepSeek

DeepSeek R1 Distill 1.5B

DeepSeek R1 · 1.5B · 128K ctx

Quality
42
Cost $/M~
$0.027
Value
1564.1
Calculate ROI
#26
DeepSeek

DeepSeek R1 Distill 8B

DeepSeek R1 · 8B · 128K ctx

Quality
Cost $/M
$0.200
Value
440.0
Calculate ROI
#27
DeepSeek

DeepSeek R1 Distill 14B

DeepSeek R1 · 14.8B · 128K ctx

Quality
Cost $/M
$0.300
Value
293.3
Calculate ROI
#28
DeepSeek

DeepSeek R1 Distill 32B

DeepSeek R1 · 32.8B · 128K ctx

Quality
Cost $/M
$0.600
Value
146.7
Calculate ROI
#29
DeepSeek

DeepSeek R1 Distill 70B

DeepSeek R1 · 70.6B · 128K ctx

Quality
Cost $/M
$0.880
Value
100.0
Calculate ROI
#30
Pareto Q×C×S
DeepSeek

DeepSeek R1

DeepSeek R1 · 671B MoE (37B active) · 128K ctx

Quality
88
Cost $/M
$2.19
Value
40.2
Calculate ROI
#31
DeepSeek

DeepSeek V3-0324

DeepSeek V3 · 685B MoE (37B active) · 128K ctx

Quality
Cost $/M
$0.420
Value
192.9
Calculate ROI
#32
Pareto Q×C×S
DeepSeek

DeepSeek V3

DeepSeek V3 · 671B MoE (37B active) · 128K ctx

Quality
81
Cost $/M
$0.420
Value
192.9
Calculate ROI
#33
Meta

Llama 3 70B

Llama 3 · 70.6B · 8K ctx

Quality
80
Cost $/M
$0.880
Value
90.9
Calculate ROI
#34
Meta

Llama 3 70B 1M Context

Llama 3 · 70.6B · 1024K ctx

Quality
Cost $/M
$1.50
Value
33.3
Calculate ROI
#35
Mistral

Mixtral 8x7B Instruct

Mixtral · 46.7B MoE (12.9B active) · 32K ctx

Quality
69
Cost $/M
$0.240
Value
287.5
Calculate ROI
#36
Mistral

Mixtral 8x7B

Mixtral · 46.7B MoE (12.9B active) · 32K ctx

Quality
67
Cost $/M
$0.600
Value
111.7
Calculate ROI
#37
Google

Gemma 2 9B

Gemma 2 · 9.2B · 8K ctx

Quality
68
Cost $/M
$0.200
Value
340.0
Calculate ROI
#38
Pareto Q×C×S
Microsoft

Phi-4

Phi · 14.7B · 16K ctx

Quality
73
Cost $/M
$0.140
Value
521.4
Calculate ROI
#39
Microsoft

Phi 4 Mini

Phi · 3.8B · 128K ctx

Quality
70
Cost $/M
$0.350
Value
200.0
Calculate ROI
#40
Meta

Llama 2 7B

Llama 2 · 7B · 4K ctx

Quality
40
Cost $/M~
$0.125
Value
319.2
Calculate ROI
#41
Meta

Llama 2 13B

Llama 2 · 13B · 4K ctx

Quality
47
Cost $/M~
$0.233
Value
202.0
Calculate ROI
#42
Meta

Llama 2 70B

Llama 2 · 70B · 4K ctx

Quality
62
Cost $/M
$0.900
Value
68.9
Calculate ROI
#43
Meta

Llama Guard 3 1B

Llama Guard · 1B · 128K ctx

Quality
Cost $/M~
$0.019
Value
2653.3
Calculate ROI
#44
Meta

Llama 3.3 8B

Llama 3.3 · 8B · 128K ctx

Quality
Cost $/M
$0.180
Value
277.8
Calculate ROI
#45
Meta

Llama Guard 3 8B

Llama Guard · 8B · 128K ctx

Quality
Cost $/M
$0.200
Value
250.0
Calculate ROI
#46
Best Context
Meta

Llama 4 Scout

Llama 4 · 109B MoE (17B active) · 10240K ctx

Quality
73
Cost $/M
$0.300
Value
243.3
Calculate ROI
#47
Meta

Code Llama 13B

Code Llama · 13B · 16K ctx

Quality
44
Cost $/M
$0.220
Value
200.0
Calculate ROI
#48
Meta

Code Llama 7B

Code Llama · 7B · 16K ctx

Quality
39
Cost $/M
$0.200
Value
195.0
Calculate ROI
#49
Pareto Q×C×S
Meta

Llama 3.1 Nemotron 51B

Llama 3.1 · 51B · 128K ctx

Quality
78
Cost $/M
$0.400
Value
195.0
Calculate ROI
#50
Meta

Llama 3.1 Nemotron 70B Reward

Llama 3.1 · 70.6B · 128K ctx

Quality
80
Cost $/M
$0.500
Value
160.0
Calculate ROI

Showing 319 of 319 models

Tracking 319 AI models across 60 GPUs and 19 providers, updated daily. The top-ranked model for overall quality is BGE Small EN v1.5 with a quality score of , available from $0.00/million output tokens. Rankings use InferenceBench's composite scoring combining benchmark results (MMLU, HumanEval, GSM8K), inference cost, and throughput efficiency.