Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

V100 32GB

nvidia · volta · 32 GB HBM2 · 300W TDP

VRAM

32 GB

BF16 TFLOPS

28.3

Bandwidth

900 GB/s

From

$0.19/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM32 GB HBM2

Memory Bandwidth900 GB/s

BF16 TFLOPS28.3

FP16 TFLOPS28.3

FP8 TFLOPS28.3

INT8 TOPS56.5

TDP300W

InterconnectNVLINK

NVLink Bandwidth300 GB/s

Max per Node8

PCIe Gen3

CUDA Compute Capability7

Tensor CoresYes

Pricing by Provider

Provider	On-Demand	Reserved	Spot	Badge
vast_ai	$0.35/hr	-	$0.19/hr	Cheapest
tensordock	$0.29/hr	-	$0.19/hr
runpod	$0.49/hr	-	$0.29/hr
aws	$3.06/hr	$1.96/hr	$0.92/hr

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model Size	Full Fine-Tune	QLoRA
7B model	5 GPUs	1 GPU
13B model	8 GPUs	1 GPU
70B model	42 GPUs	2 GPUs

Train on this GPU →

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B

0.41 t/s/WFP8

Qwen 2.5 7B

0.39 t/s/WFP8

Llama 3.1 8B

0.37 t/s/WFP8

Llama 3.1 70B

0.04 t/s/WFP8

Qwen 2.5 72B

0.04 t/s/WFP8

Similar GPUs

GPU	VRAM	BF16 TFLOPS	BW (GB/s)	From
V100 16GB	16 GB	28.3	900	$0.15/hr
RTX 5090	32 GB	210	1792	$0.89/hr
Instinct MI100	32 GB	184.6	1229	$0.40/hr
TPU v4	32 GB	275	1200	$2.25/hr
TPU v6e (Trillium)	32 GB	460	1640	$1.75/hr

V100 32GB

Spec Sheet

Pricing by Provider

Compatible Models (235)

Single GPU (160 models)

Multi-GPU (75 models)

Training Capabilities

Energy Efficiency

Similar GPUs