Updated minutes ago
Instinct MI300X
amd · cdna3 · 192 GB HBM3 · 750W TDP
VRAM
192 GB
BF16 TFLOPS
1307
Bandwidth
5300 GB/s
From
$1.79/hr
Spec Sheet
VRAM192 GB HBM3
Memory Bandwidth5300 GB/s
BF16 TFLOPS1307
FP16 TFLOPS1307
FP8 TFLOPS2614
INT8 TOPS2614
TDP750W
InterconnectINFINITY-FABRIC
Max per Node8
PCIe Gen5
Tensor CoresNo
Pricing by Provider
| Provider | On-Demand | Reserved | Spot | Badge |
|---|---|---|---|---|
| fluidstack | $2.39/hr | - | $1.79/hr | Cheapest |
| vast_ai | $2.79/hr | - | $1.99/hr | |
| tensordock | $2.69/hr | - | $1.99/hr | |
| lambda | $2.49/hr | - | - | |
| coreweave | $3.39/hr | $2.49/hr | - | |
| runpod | $3.49/hr | - | $2.59/hr |
Compatible Models (251)
Single GPU (226 models)
Mixtral 8x22B141B FP8DBRX Base132B FP8DBRX Instruct132B FP8Mistral Large 2411123B FP8Mistral Large 2123B FP8Llama 4 Scout109B FP8Command R+104B FP8Yi-Large102.6B FP8YaLM 100B100B FP8Llama 3.2 90B Vision90B FP8Llama 3.2 90B Vision Instruct88.8B FP8Qwen 2.5 72B72.7B FP8Qwen 2.5 Math 72B72.7B FP8Qwen 2.5 VL 72B72.7B FP8Dolphin 2.9 72B72B FP8DeepSeek R1 Distill 70B70.6B FP8Llama 3 70B 1M Context70.6B FP8Llama 3 70B70.6B FP8Llama 3.1 70B70.6B FP8Llama 3.3 70B70.6B FP8+206 more
Multi-GPU (25 models)
Grok-2x2 FP8DeepSeek Coder V2 236Bx2 FP8DeepSeek V2.5x2 FP8Qwen 3 235Bx2 FP8Falcon 180Bx2 FP8Command Ax2 BF16Inflection 3x2 BF16Snowflake Arctic 480Bx3 FP8Llama 3.1 405Bx3 FP8Llama 4 Maverickx3 FP8Jamba 1.5 Largex3 FP8Snowflake Arctic 128x3Bx3 FP8Nemotron 340Bx3 FP8Claude Opus 4x3 BF16GPT-4ox3 BF16+10 more
Training Capabilities
Estimated GPU count for full fine-tuning (AdamW, BF16) and QLoRA
| Model Size | Full Fine-Tune | QLoRA |
|---|---|---|
| 7B model | 1 GPU | 1 GPU |
| 13B model | 2 GPUs | 1 GPU |
| 70B model | 7 GPUs | 1 GPU |
Energy Efficiency
Estimated tokens/second per Watt for popular models
Mistral 7B
0.97 t/s/WFP8
Qwen 2.5 7B
0.93 t/s/WFP8
Llama 3.1 8B
0.88 t/s/WFP8
DeepSeek V3
0.19 t/s/WFP8
Llama 3.1 70B
0.10 t/s/WFP8
Qwen 2.5 72B
0.10 t/s/WFP8
Similar GPUs
| GPU | VRAM | BF16 TFLOPS | BW (GB/s) | From |
|---|---|---|---|---|
| Instinct MI325X | 256 GB | 1307 | 6000 | $2.49/hr |
| B100 SXM | 192 GB | 1750 | 8000 | $4.50/hr |
| GB200 NVL72 (per GPU) | 192 GB | 2250 | 8000 | $6.50/hr |
| GB300 NVL72 (per GPU) | 192 GB | 2500 | 8000 | $7.50/hr |
| H100 NVL 94GB (per GPU pair) | 188 GB | 1670 | 7876 | $5.49/hr |