Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

A4000

nvidia · ampere · 16 GB GDDR6 · 140W TDP

VRAM

16 GB

BF16 TFLOPS

Bandwidth

448 GB/s

From

$0.17/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM16 GB GDDR6

Memory Bandwidth448 GB/s

BF16 TFLOPS76

FP16 TFLOPS76

FP8 TFLOPS76

INT8 TOPS76

TDP140W

InterconnectPCIE

Max per Node8

PCIe Gen4

CUDA Compute Capability8.6

Tensor CoresYes

Pricing by Provider

Provider	On-Demand	Reserved	Spot	Badge
tensordock	$0.25/hr	-	$0.17/hr	Cheapest
vast_ai	$0.30/hr	-	$0.18/hr

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model Size	Full Fine-Tune	QLoRA
7B model	9 GPUs	1 GPU
13B model	16 GPUs	1 GPU
70B model	83 GPUs	3 GPUs

Train on this GPU →

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B

0.44 t/s/WFP8

Qwen 2.5 7B

0.42 t/s/WFP8

Llama 3.1 8B

0.40 t/s/WFP8

Llama 3.1 70B

0.05 t/s/WFP8

Qwen 2.5 72B

0.04 t/s/WFP8

Similar GPUs

GPU	VRAM	BF16 TFLOPS	BW (GB/s)	From
A2	16 GB	18	200	$0.15/hr
RTX 3060	12 GB	12.7	360	$0.06/hr
RTX 3080	10 GB	47	760	$0.14/hr
A10G	24 GB	35	600	$0.30/hr
A30	24 GB	165	933	$0.35/hr

A4000

Spec Sheet

Pricing by Provider

Compatible Models (222)

Single GPU (134 models)

Multi-GPU (88 models)

Training Capabilities

Energy Efficiency

Similar GPUs