Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

RTX 3080

nvidia · ampere · 10 GB GDDR6X · 320W TDP

VRAM

10 GB

BF16 TFLOPS

Bandwidth

760 GB/s

From

$0.14/hr

Spec Sheet

VRAM10 GB GDDR6X

Memory Bandwidth760 GB/s

BF16 TFLOPS47

FP16 TFLOPS47

FP8 TFLOPS47

INT8 TOPS47

TDP320W

InterconnectPCIE

Max per Node4

PCIe Gen4

CUDA Compute Capability8.6

Tensor CoresYes

Provider	On-Demand	Reserved	Spot	Badge
tensordock	$0.22/hr	-	$0.14/hr	Cheapest
vast_ai	$0.28/hr	-	$0.16/hr

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Estimated tokens/second per Watt for popular models

Mistral 7B

0.33 t/s/WFP8

Qwen 2.5 7B

0.31 t/s/WFP8

Llama 3.1 8B

0.30 t/s/WFP8

GPU	VRAM	BF16 TFLOPS	BW (GB/s)	From
RTX 3070	8 GB	20.4	448	$0.09/hr
RTX 3060	12 GB	12.7	360	$0.06/hr
A4000	16 GB	76	448	$0.17/hr
A2	16 GB	18	200	$0.15/hr
A10G	24 GB	35	600	$0.30/hr