RTX 4090 vs L40S
Side-by-side comparison of the NVIDIA RTX 4090 and the NVIDIA L40S for AI inference workloads.
Specifications
Pricing
RTX 4090
| Provider | On-Demand | Reserved | Spot |
|---|---|---|---|
| runpod | $1.10/hr | - | $0.79/hr |
| lambda | $0.89/hr | - | - |
| vast ai | $0.74/hr | - | $0.44/hr |
| tensordock | $0.69/hr | - | $0.44/hr |
| fluidstack | $0.59/hr | - | $0.39/hr |
L40S
| Provider | On-Demand | Reserved | Spot |
|---|---|---|---|
| runpod | $1.90/hr | - | $1.49/hr |
| lambda | $1.59/hr | $1.19/hr | - |
| coreweave | $1.84/hr | $1.34/hr | - |
| aws | $2.56/hr | $1.69/hr | - |
| gcp | $2.45/hr | $1.62/hr | - |
| vast ai | $1.29/hr | - | $0.95/hr |
| tensordock | $1.19/hr | - | $0.89/hr |
| fluidstack | $1.09/hr | - | $0.85/hr |
Cheapest available rate: RTX 4090 at $0.39/hr vs L40S at $0.85/hr — RTX 4090 is +118% cheaper
Efficiency Metrics
TFLOPS / Watt
0.4
RTX 4090
1.0
L40S
BF16
VRAM / Dollar
61.5
RTX 4090
56.5
L40S
GB/$/hr
Bandwidth / Watt
2.2
RTX 4090
2.5
L40S
GB/s/W
Models (FP16, 1 GPU)
124.0
RTX 4090
162.0
L40S
Model Compatibility (FP16, Single GPU)
Only on RTX 4090 (0)
None
Both (124)
- Yi 1.5 9B
- Yi Coder 9B
- GTE Qwen2 7B
- Marco O1
- Qwen 2 Audio 7B
- Qwen 2.5 3B
- OLMo 2 7B
- OpenELM 3B
- BGE Large EN v1.5
- BGE M3
- StarCoder2 3B
- StarCoder2 7B
- Aya 23 8B
- Command R 7B
- DeepSeek Coder 6.7B
- DeepSeek Math 7B
- DeepSeek R1 Distill 8B
- Falcon 7B
- Gemma 1.1 2B
- Gemma 2 2B
- +104 more
Only on L40S (38)
- Qwen 1.5 MoE A2.7B
- Qwen 2.5 14B
- OLMo 2 13B
- Amazon Nova Lite
- Baichuan 2 13B
- OctoCoder 15B
- StarCoder2 15B
- DeepSeek MoE 16B
- DeepSeek R1 Distill 14B
- Gemma 3 12B
- InternLM 2.5 20B
- Vicuna 13B
- Code Llama 13B
- Llama 2 13B
- Orca 2 13B
- Phi 3 Medium 14B
- Phi-4
- Codestral 22B
- Mistral Nemo 12B
- Pixtral 12B
- +18 more
Summary
The RTX 4090 (ada generation) offers 24GB of GDDR6X with 165 BF16 TFLOPS and 1,008 GB/s memory bandwidth at 450W TDP.
The L40S (ada generation) offers 48GB of GDDR6 with 362 BF16 TFLOPS and 864 GB/s memory bandwidth at 350W TDP.
The L40S has +100% more VRAM, allowing it to run larger models without multi-GPU setups.
From a cost perspective, the RTX 4090 is more affordable at $0.39/hr vs $0.85/hr for the L40S.