Updated minutes ago
L40
nvidia · ada · 48 GB GDDR6 · 300W TDP
VRAM
48 GB
BF16 TFLOPS
362
Bandwidth
864 GB/s
From
$0.75/hr
Spec Sheet
VRAM48 GB GDDR6
Memory Bandwidth864 GB/s
BF16 TFLOPS362
FP16 TFLOPS362
FP8 TFLOPS733
INT8 TOPS733
TDP300W
InterconnectPCIE
Max per Node8
PCIe Gen4
CUDA Compute Capability8.9
Tensor CoresYes
Pricing by Provider
| Provider | On-Demand | Reserved | Spot | Badge |
|---|---|---|---|---|
| tensordock | $0.99/hr | - | $0.75/hr | Cheapest |
| vast_ai | $1.09/hr | - | $0.79/hr | |
| coreweave | $1.58/hr | $1.14/hr | - | |
| runpod | $1.59/hr | - | $1.19/hr |
Compatible Models (239)
Single GPU (179 models)
Falcon 40B40B FP8VILA 1.5 40B40B FP8Aya 23 35B35B FP8Command R35B FP8Command R (August 2024)35B FP8Yi 1.5 34B34.4B FP8Code Llama 34B34B FP8DeepSeek Coder 33B33B FP8Vicuna 33B33B FP8WizardCoder 33B33B FP8DeepSeek R1 Distill 32B32.8B FP8Qwen 3 32B32.8B FP8Qwen 2.5 32B32.5B FP8Qwen 2.5 Coder 32B32.5B FP8Qwen 3 30B-A3B30.5B FP8JAIS 30B30B FP8MPT 30B30B FP8Gemma 2 27B27B FP8Gemma 3 27B27B FP8InternVL2 26B26B FP8+159 more
Multi-GPU (60 models)
Qwen 2.5 72Bx2 FP8Qwen 2.5 Math 72Bx2 FP8Qwen 2.5 VL 72Bx2 FP8Dolphin 2.9 72Bx2 FP8DeepSeek R1 Distill 70Bx2 FP8Llama 3 70B 1M Contextx2 FP8Llama 3 70Bx2 FP8Llama 3.1 70Bx2 FP8Llama 3.3 70Bx2 FP8Hermes 3 70Bx2 FP8HelpSteer2 Llama 3.1 70Bx2 FP8Llama 3.1 Nemotron 70B Instructx2 FP8Llama 3.1 Nemotron 70B Rewardx2 FP8Nemotron 70Bx2 FP8Llama 3.1 70B Turbox2 FP8+45 more
Training Capabilities
Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA
| Model Size | Full Fine-Tune | QLoRA |
|---|---|---|
| 7B model | 3 GPUs | 1 GPU |
| 13B model | 6 GPUs | 1 GPU |
| 70B model | 28 GPUs | 1 GPU |
Energy Efficiency
Estimated tokens/second per Watt for popular models
Mistral 7B
0.39 t/s/WFP8
Qwen 2.5 7B
0.38 t/s/WFP8
Llama 3.1 8B
0.36 t/s/WFP8
Llama 3.1 70B
0.04 t/s/WFP8
Qwen 2.5 72B
0.04 t/s/WFP8
Similar GPUs
| GPU | VRAM | BF16 TFLOPS | BW (GB/s) | From |
|---|---|---|---|---|
| L40S | 48 GB | 362 | 864 | $0.85/hr |
| RTX 6000 Ada | 48 GB | 91.1 | 960 | $0.59/hr |
| L20 | 48 GB | 239 | 864 | $0.80/hr |
| L4 | 24 GB | 121 | 300 | $0.29/hr |
| RTX 4090 | 24 GB | 165 | 1008 | $0.39/hr |