Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

RTX 4070 Ti

nvidia · ada · 12 GB GDDR6X · 285W TDP

VRAM

12 GB

BF16 TFLOPS

Bandwidth

504 GB/s

From

$0.25/hr

Spec Sheet

VRAM12 GB GDDR6X

Memory Bandwidth504 GB/s

BF16 TFLOPS93

FP16 TFLOPS93

FP8 TFLOPS186

INT8 TOPS186

TDP285W

InterconnectPCIE

Max per Node4

PCIe Gen4

CUDA Compute Capability8.9

Tensor CoresYes

Provider	On-Demand	Reserved	Spot	Badge
tensordock	$0.39/hr	-	$0.25/hr	Cheapest
vast_ai	$0.45/hr	-	$0.28/hr

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Estimated tokens/second per Watt for popular models

Mistral 7B

0.24 t/s/WFP8

Qwen 2.5 7B

0.23 t/s/WFP8

Llama 3.1 8B

0.22 t/s/WFP8

GPU	VRAM	BF16 TFLOPS	BW (GB/s)	From
RTX 4070 Super	12 GB	55	504	$0.22/hr
RTX 4080	16 GB	97	717	$0.32/hr
RTX 4060 Ti 16GB	16 GB	44	288	$0.30/hr
RTX 4060	8 GB	30	272	$0.22/hr
L4	24 GB	121	300	$0.29/hr