Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

A100 40GB SXM

nvidia · ampere · 40 GB HBM2e · 400W TDP

VRAM

40 GB

BF16 TFLOPS

312

Bandwidth

1555 GB/s

From

$0.85/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM40 GB HBM2e

Memory Bandwidth1555 GB/s

BF16 TFLOPS312

FP16 TFLOPS312

FP8 TFLOPS312

INT8 TOPS624

TDP400W

InterconnectNVLINK

NVLink Bandwidth600 GB/s

Max per Node8

PCIe Gen4

CUDA Compute Capability8

Tensor CoresYes

Pricing by Provider

Provider	On-Demand	Reserved	Spot	Badge
tensordock	$1.19/hr	-	$0.85/hr	Cheapest
vast_ai	$1.30/hr	-	$0.89/hr
runpod	$1.64/hr	-	$1.19/hr
lambda	$1.29/hr	-	-
aws	$3.06/hr	$1.96/hr	-
gcp	$2.93/hr	$1.98/hr	-

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model Size	Full Fine-Tune	QLoRA
7B model	4 GPUs	1 GPU
13B model	7 GPUs	1 GPU
70B model	33 GPUs	2 GPUs

Train on this GPU →

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B

0.53 t/s/WFP8

Qwen 2.5 7B

0.51 t/s/WFP8

Llama 3.1 8B

0.48 t/s/WFP8

Llama 3.1 70B

0.06 t/s/WFP8

Qwen 2.5 72B

0.05 t/s/WFP8

Similar GPUs

GPU	VRAM	BF16 TFLOPS	BW (GB/s)	From
A100 40GB PCIe	40 GB	312	1555	$0.69/hr
RTX A6000	48 GB	38.7	768	$0.49/hr
A40	48 GB	37.4	696	$0.42/hr
A10G	24 GB	35	600	$0.30/hr
A30	24 GB	165	933	$0.35/hr

A100 40GB SXM

Spec Sheet

Pricing by Provider

Compatible Models (238)

Single GPU (171 models)

Multi-GPU (67 models)

Training Capabilities

Energy Efficiency

Similar GPUs