Skip to content
Updated minutes ago
nvidia

A2

nvidia · ampere · 16 GB GDDR6 · 60W TDP

VRAM

16 GB

BF16 TFLOPS

18

Bandwidth

200 GB/s

From

$0.15/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM16 GB GDDR6
Memory Bandwidth200 GB/s
BF16 TFLOPS18
FP16 TFLOPS18
FP8 TFLOPS36
INT8 TOPS36
TDP60W
InterconnectPCIE
Max per Node1
PCIe Gen4
CUDA Compute Capability8.6
Tensor CoresYes

Pricing by Provider

ProviderOn-DemandReservedSpotBadge
retail$0.15/hr--Cheapest

Compatible Models (159)

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model SizeFull Fine-TuneQLoRA
7B model9 GPUs1 GPU
13B model16 GPUs1 GPU
70B model83 GPUs3 GPUs

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B
0.46 t/s/WFP8
Qwen 2.5 7B
0.44 t/s/WFP8
Llama 3.1 8B
0.42 t/s/WFP8

Similar GPUs

GPUVRAMBF16 TFLOPSBW (GB/s)From
A400016 GB76448$0.17/hr
RTX 306012 GB12.7360$0.06/hr
RTX 308010 GB47760$0.14/hr
A10G24 GB35600$0.30/hr
A3024 GB165933$0.35/hr