Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

GH200

nvidia · hopper · 96 GB HBM3 · 900W TDP

VRAM

96 GB

BF16 TFLOPS

990

Bandwidth

4000 GB/s

From

$2.99/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM96 GB HBM3

Memory Bandwidth4000 GB/s

BF16 TFLOPS990

FP16 TFLOPS990

FP8 TFLOPS1980

INT8 TOPS1980

TDP900W

InterconnectNVLINK

NVLink Bandwidth900 GB/s

Max per Node8

PCIe Gen5

CUDA Compute Capability9

Tensor CoresYes

Pricing by Provider

Provider	On-Demand	Reserved	Spot	Badge
coreweave	$3.99/hr	$2.99/hr	-	Cheapest
lambda	$3.49/hr	-	-

Compatible Models (249)

Multi-GPU (42 models)

Mixtral 8x22Bx2 FP8 DBRX Basex2 FP8 DBRX Instructx2 FP8 Mistral Large 2411x2 FP8 Mistral Large 2x2 FP8 Llama 4 Scoutx2 FP8 Command R+x2 FP8 Yi-Largex2 FP8 YaLM 100Bx2 FP8 Llama 3.2 90B Visionx2 FP8 Llama 3.2 90B Vision Instructx2 FP8 Claude Sonnet 4x2 BF16 o1-minix2 BF16 o3-minix2 BF16 Claude 3 Sonnetx2 BF16+27 more

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model Size	Full Fine-Tune	QLoRA
7B model	2 GPUs	1 GPU
13B model	3 GPUs	1 GPU
70B model	14 GPUs	1 GPU

Train on this GPU →

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B

0.61 t/s/WFP8

Qwen 2.5 7B

0.58 t/s/WFP8

Llama 3.1 8B

0.55 t/s/WFP8

Llama 3.1 70B

0.06 t/s/WFP8

Qwen 2.5 72B