Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

RTX 4060 Ti 16GB

nvidia · ada · 16 GB GDDR6 · 165W TDP

VRAM

16 GB

BF16 TFLOPS

Bandwidth

288 GB/s

From

$0.30/hr

Spec Sheet

VRAM16 GB GDDR6

Memory Bandwidth288 GB/s

BF16 TFLOPS44

FP16 TFLOPS44

FP8 TFLOPS88

INT8 TOPS88

TDP165W

InterconnectPCIE

Max per Node1

PCIe Gen4

CUDA Compute Capability8.9

Tensor CoresYes

Provider	On-Demand	Reserved	Spot	Badge
retail	$0.30/hr	-	-	Cheapest

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Estimated tokens/second per Watt for popular models

Mistral 7B

0.24 t/s/WFP8

Qwen 2.5 7B

0.23 t/s/WFP8

Llama 3.1 8B

0.22 t/s/WFP8

GPU	VRAM	BF16 TFLOPS	BW (GB/s)	From
RTX 4080	16 GB	97	717	$0.32/hr
RTX 4070 Ti	12 GB	93	504	$0.25/hr
RTX 4070 Super	12 GB	55	504	$0.22/hr
L4	24 GB	121	300	$0.29/hr
RTX 4090	24 GB	165	1008	$0.39/hr