Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

H100 NVL 94GB (per GPU pair)

nvidia · hopper · 188 GB HBM3 · 800W TDP

VRAM

188 GB

BF16 TFLOPS

1670

Bandwidth

7876 GB/s

From

$5.49/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM188 GB HBM3

Memory Bandwidth7876 GB/s

BF16 TFLOPS1670

FP16 TFLOPS1670

FP8 TFLOPS3341

INT8 TOPS3341

TDP800W

InterconnectNVLINK

NVLink Bandwidth600 GB/s

Max per Node4

PCIe Gen5

CUDA Compute Capability9

Tensor CoresYes

Pricing by Provider

Provider	On-Demand	Reserved	Spot	Badge
coreweave	$7.49/hr	$5.49/hr	-	Cheapest
runpod	$7.89/hr	-	-

Compatible Models (249)

Multi-GPU (23 models)

Grok-2x2 FP8 DeepSeek Coder V2 236Bx2 FP8 DeepSeek V2.5x2 FP8 Qwen 3 235Bx2 FP8 Falcon 180Bx2 FP8 Command Ax2 BF16 Inflection 3x2 BF16 DeepSeek R1x3 INT4 DeepSeek V3x3 INT4 Llama 3.1 405Bx3 FP8 Llama 4 Maverickx3 FP8 Jamba 1.5 Largex3 FP8 Snowflake Arctic 128x3Bx3 FP8 Nemotron 340Bx3 FP8 Claude Opus 4x3 BF16+8 more

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model Size	Full Fine-Tune	QLoRA
7B model	1 GPU	1 GPU
13B model	2 GPUs	1 GPU
70B model	8 GPUs	1 GPU

Train on this GPU →

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B

1.35 t/s/WFP8

Qwen 2.5 7B

1.30 t/s/WFP8

Llama 3.1 8B

1.23 t/s/WFP8

Llama 3.1 70B

0.14 t/s/WFP8

Qwen 2.5 72B