Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

Gaudi 3

intel · gaudi · 128 GB HBM2e · 900W TDP

VRAM

128 GB

BF16 TFLOPS

1835

Bandwidth

3700 GB/s

From

$2.30/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM128 GB HBM2e

Memory Bandwidth3700 GB/s

BF16 TFLOPS1835

FP16 TFLOPS1835

FP8 TFLOPS3670

INT8 TOPS3670

TDP900W

InterconnectOAM

Max per Node8

PCIe Gen5

Tensor CoresNo

Pricing by Provider

Provider	On-Demand	Reserved	Spot	Badge
gcp	$3.20/hr	$2.30/hr	-	Cheapest
aws	$3.78/hr	$2.65/hr	-

Compatible Models (249)

Multi-GPU (34 models)

Falcon 180Bx2 FP8 Mixtral 8x22Bx2 FP8 DBRX Basex2 FP8 DBRX Instructx2 FP8 Mistral Large 2411x2 FP8 Mistral Large 2x2 FP8 Llama 4 Scoutx2 FP8 Inflection 3x2 BF16 Claude Sonnet 4x2 BF16 o1-minix2 BF16 o3-minix2 BF16 Claude 3 Sonnetx2 BF16 Reka Corex2 BF16 Grok-2x3 FP8 DeepSeek Coder V2 236Bx3 FP8+19 more

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model Size	Full Fine-Tune	QLoRA
7B model	2 GPUs	1 GPU
13B model	2 GPUs	1 GPU
70B model	11 GPUs	1 GPU

Train on this GPU →

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B

0.56 t/s/WFP8

Qwen 2.5 7B

0.54 t/s/WFP8

Llama 3.1 8B

0.51 t/s/WFP8

DeepSeek V3

0.11 t/s/WFP8

Llama 3.1 70B

0.06 t/s/WFP8

Qwen 2.5 72B