Updated minutes ago· Sources: GPU Pricing, API Token Pricing, Model Registry

Instinct MI300X

amd · cdna3 · 192 GB HBM3 · 750W TDP

VRAM

192 GB

BF16 TFLOPS

1307

Bandwidth

5300 GB/s

From

$1.79/hr

Calculate ROI with this GPU →

Spec Sheet

VRAM192 GB HBM3

Memory Bandwidth5300 GB/s

BF16 TFLOPS1307

FP16 TFLOPS1307

FP8 TFLOPS2614

INT8 TOPS2614

TDP750W

InterconnectINFINITY-FABRIC

Max per Node8

PCIe Gen5

Tensor CoresNo

Pricing by Provider

Provider	On-Demand	Reserved	Spot	Badge
fluidstack	$2.39/hr	-	$1.79/hr	Cheapest
vast_ai	$2.79/hr	-	$1.99/hr
tensordock	$2.69/hr	-	$1.99/hr
lambda	$2.49/hr	-	-
coreweave	$3.39/hr	$2.49/hr	-
runpod	$3.49/hr	-	$2.59/hr

Compatible Models (251)

Multi-GPU (25 models)

Grok-2x2 FP8 DeepSeek Coder V2 236Bx2 FP8 DeepSeek V2.5x2 FP8 Qwen 3 235Bx2 FP8 Falcon 180Bx2 FP8 Command Ax2 BF16 Inflection 3x2 BF16 Snowflake Arctic 480Bx3 FP8 Llama 3.1 405Bx3 FP8 Llama 4 Maverickx3 FP8 Jamba 1.5 Largex3 FP8 Snowflake Arctic 128x3Bx3 FP8 Nemotron 340Bx3 FP8 Claude Opus 4x3 BF16 GPT-4ox3 BF16+10 more

Training Capabilities

Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA

Model Size	Full Fine-Tune	QLoRA
7B model	1 GPU	1 GPU
13B model	2 GPUs	1 GPU
70B model	7 GPUs	1 GPU

Train on this GPU →

Energy Efficiency

Estimated tokens/second per Watt for popular models

Mistral 7B

0.97 t/s/WFP8

Qwen 2.5 7B

0.93 t/s/WFP8

Llama 3.1 8B

0.88 t/s/WFP8

DeepSeek V3

0.19 t/s/WFP8

Llama 3.1 70B

0.10 t/s/WFP8

Qwen 2.5 72B