Skip to content
Updated minutes ago
google

TPU v4

google · tpu · 32 GB HBM · 175W TDP

VRAM

32 GB

BF16 TFLOPS

275

Bandwidth

1200 GB/s

From

$2.25/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM32 GB HBM
Memory Bandwidth1200 GB/s
BF16 TFLOPS275
FP16 TFLOPS275
FP8 TFLOPS275
INT8 TOPS550
TDP175W
InterconnectPCIE
Max per Node4096
PCIe Gen4
Tensor CoresNo

Pricing by Provider

ProviderOn-DemandReservedSpotBadge
gcp$3.22/hr$2.25/hr-Cheapest

Compatible Models (253)

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model SizeFull Fine-TuneQLoRA
7B model5 GPUs1 GPU
13B model8 GPUs1 GPU
70B model42 GPUs2 GPUs

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B
0.94 t/s/WFP8
Qwen 2.5 7B
0.90 t/s/WFP8
Llama 3.1 8B
0.85 t/s/WFP8
DeepSeek V3
0.19 t/s/WFP8
Llama 3.1 70B
0.10 t/s/WFP8
Qwen 2.5 72B
0.09 t/s/WFP8

Similar GPUs

GPUVRAMBF16 TFLOPSBW (GB/s)From
TPU v6e (Trillium)32 GB4601640$1.75/hr
TPU v5e16 GB200820$0.85/hr
RTX 509032 GB2101792$0.89/hr
V100 32GB32 GB28.3900$0.19/hr
Instinct MI10032 GB184.61229$0.40/hr