Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

Instinct MI250X

amd · cdna2 · 128 GB HBM2e · 560W TDP

VRAM

128 GB

BF16 TFLOPS

383

Bandwidth

3277 GB/s

From

$0.79/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM128 GB HBM2e

Memory Bandwidth3277 GB/s

BF16 TFLOPS383

FP16 TFLOPS383

FP8 TFLOPS383

INT8 TOPS383

TDP560W

InterconnectINFINITY-FABRIC

Max per Node8

PCIe Gen4

Tensor CoresNo

Pricing by Provider

Provider	On-Demand	Reserved	Spot	Badge
fluidstack	$1.09/hr	-	$0.79/hr	Cheapest
vast_ai	$1.20/hr	-	$0.85/hr

Compatible Models (249)

Multi-GPU (34 models)

Falcon 180Bx2 FP8 Mixtral 8x22Bx2 FP8 DBRX Basex2 FP8 DBRX Instructx2 FP8 Mistral Large 2411x2 FP8 Mistral Large 2x2 FP8 Llama 4 Scoutx2 FP8 Inflection 3x2 BF16 Claude Sonnet 4x2 BF16 o1-minix2 BF16 o3-minix2 BF16 Claude 3 Sonnetx2 BF16 Reka Corex2 BF16 Grok-2x3 FP8 DeepSeek Coder V2 236Bx3 FP8+19 more

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model Size	Full Fine-Tune	QLoRA
7B model	2 GPUs	1 GPU
13B model	2 GPUs	1 GPU
70B model	11 GPUs	1 GPU

Train on this GPU →

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B

0.80 t/s/WFP8

Qwen 2.5 7B

0.77 t/s/WFP8

Llama 3.1 8B

0.73 t/s/WFP8

DeepSeek V3

0.16 t/s/WFP8

Llama 3.1 70B

0.08 t/s/WFP8

Qwen 2.5 72B