Skip to content
Updated minutes ago
nvidia

A16

nvidia · ampere · 64 GB GDDR6 · 250W TDP

VRAM

64 GB

BF16 TFLOPS

16.8

Bandwidth

232 GB/s

From

$0.72/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM64 GB GDDR6
Memory Bandwidth232 GB/s
BF16 TFLOPS16.8
FP16 TFLOPS16.8
FP8 TFLOPS16.8
INT8 TOPS33.6
TDP250W
InterconnectPCIE
Max per Node8
PCIe Gen4
CUDA Compute Capability8.6
Tensor CoresYes

Pricing by Provider

ProviderOn-DemandReservedSpotBadge
aws$1.10/hr$0.72/hr-Cheapest

Compatible Models (248)

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model SizeFull Fine-TuneQLoRA
7B model3 GPUs1 GPU
13B model4 GPUs1 GPU
70B model21 GPUs1 GPU

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B
0.13 t/s/WFP8
Qwen 2.5 7B
0.12 t/s/WFP8
Llama 3.1 8B
0.12 t/s/WFP8
Llama 3.1 70B
0.01 t/s/WFP8
Qwen 2.5 72B
0.01 t/s/WFP8

Similar GPUs

GPUVRAMBF16 TFLOPSBW (GB/s)From
A100 80GB SXM80 GB3122039$1.19/hr
A100 80GB PCIe80 GB3122039$1.05/hr
RTX A600048 GB38.7768$0.49/hr
A4048 GB37.4696$0.42/hr
A100 40GB SXM40 GB3121555$0.85/hr