What is the difference between RTX 4090 and RTX 5090?

The RTX 4090 has 24GB GDDR6X with 165 BF16 TFLOPS, while the RTX 5090 has 32GB GDDR6X with 210 BF16 TFLOPS. The RTX 4090 has 1008 GB/s memory bandwidth vs 1792 GB/s for the RTX 5090.

Which GPU is cheaper, RTX 4090 or RTX 5090?

The cheapest on-demand rate for the RTX 4090 is $0.39/hr, while the RTX 5090 starts at $0.89/hr. The RTX 4090 is +128% cheaper.

How many AI models fit on the RTX 4090 vs the RTX 5090?

At FP16 precision on a single GPU, the RTX 4090 can run 124 models from our catalog, while the RTX 5090 can run 148 models. The RTX 5090 supports 24 more models due to its 32GB VRAM.

RTX 4090 vs RTX 5090

Side-by-side comparison of the NVIDIA RTX 4090 and the NVIDIA RTX 5090 for AI inference workloads.

Specifications

Spec	RTX 4090	RTX 5090
Generation	ada	blackwell
Memory Type	GDDR6X	GDDR6X
VRAM	24 GB	32 GB
Memory Bandwidth	1,008 GB/s	1,792 GB/s
BF16 TFLOPS	165	210
FP16 TFLOPS	165	210
FP8 TFLOPS	330	420
INT8 TOPS	330	420
TDP	450 W	575 W
Interconnect	pcie	pcie
Max GPUs per Node	4	4
PCIe Gen	Gen 4	Gen 5
CUDA Compute Capability	8.9	8.9

Pricing

RTX 4090

Provider	On-Demand	Reserved	Spot
runpod	$1.10/hr	-	$0.79/hr
lambda	$0.89/hr	-	-
vast ai	$0.74/hr	-	$0.44/hr
tensordock	$0.69/hr	-	$0.44/hr
fluidstack	$0.59/hr	-	$0.39/hr

RTX 5090

Provider	On-Demand	Reserved	Spot
runpod	$1.69/hr	-	$1.29/hr
vast ai	$1.29/hr	-	$0.89/hr
tensordock	$1.19/hr	-	$0.89/hr

Cheapest available rate: RTX 4090 at $0.39/hr vs RTX 5090 at $0.89/hr — RTX 4090 is +128% cheaper

Efficiency Metrics

TFLOPS / Watt

0.4

RTX 4090

0.4

RTX 5090

BF16

VRAM / Dollar

61.5

RTX 4090

36.0

RTX 5090

GB/$/hr

Bandwidth / Watt

2.2

RTX 4090

3.1

RTX 5090

GB/s/W

Models (FP16, 1 GPU)

124.0

RTX 4090

148.0

RTX 5090

Model Compatibility (FP16, Single GPU)

Only on RTX 4090 (0)

None

Both (124)

Yi 1.5 9B
Yi Coder 9B
GTE Qwen2 7B
Marco O1
Qwen 2 Audio 7B
Qwen 2.5 3B
OLMo 2 7B
OpenELM 3B
BGE Large EN v1.5
BGE M3
StarCoder2 3B
StarCoder2 7B
Aya 23 8B
Command R 7B
DeepSeek Coder 6.7B
DeepSeek Math 7B
DeepSeek R1 Distill 8B
Falcon 7B
Gemma 1.1 2B
Gemma 2 2B
+104 more

Only on RTX 5090 (24)

Qwen 1.5 MoE A2.7B
Qwen 2.5 14B
OLMo 2 13B
Amazon Nova Lite
Baichuan 2 13B
DeepSeek R1 Distill 14B
Gemma 3 12B
Vicuna 13B
Code Llama 13B
Llama 2 13B
Orca 2 13B
Phi 3 Medium 14B
Phi-4
Mistral Nemo 12B
Pixtral 12B
VILA 1.5 13B
Qwen 2.5 Coder 14B
RWKV-6 14B
StableLM 2 12B
ELYZA 13B
+4 more

Summary

The RTX 4090 (ada generation) offers 24GB of GDDR6X with 165 BF16 TFLOPS and 1,008 GB/s memory bandwidth at 450W TDP.

The RTX 5090 (blackwell generation) offers 32GB of GDDR6X with 210 BF16 TFLOPS and 1,792 GB/s memory bandwidth at 575W TDP.

The RTX 5090 has +33% more VRAM, allowing it to run larger models without multi-GPU setups.

From a cost perspective, the RTX 4090 is more affordable at $0.39/hr vs $0.89/hr for the RTX 5090.

More GPU Comparisons

H100 SXM vs A100 80GB SXM H200 SXM vs H100 SXM H100 SXM vs H100 PCIe A100 80GB SXM vs A100 40GB SXM RTX 4090 vs L40S H100 SXM vs B200 SXM A100 80GB SXM vs L40S RTX 3090 vs RTX 4090 H100 NVL vs H100 SXM B200 SXM vs H200 SXM B200 SXM vs B100 SXM H200 SXM vs A100 80GB SXM