Updated minutes ago
Snowflake Arctic 480B
Snowflake · moe · 480B parameters · 4,096 context
Quality50.0
Architecture Details
TypeMOE
Total Parameters480B
Active Parameters17B
Layers35
Hidden Dimension7,168
Attention Heads56
KV Heads8
Head Dimension128
Vocab Size32,000
Total Experts128
Active Experts2
Memory Requirements
BF16 Weights
960.0 GB
FP8 Weights
480.0 GB
INT4 Weights
240.0 GB
KV-Cache per Token143360 bytes
Activation Estimate2.00 GB
Fits on (single-node)
B200 NVL (pair) INT4B300 INT4B200 SXMx2 INT4B100 SXMx2 INT4GB200 NVL72 (per GPU)x2 INT4GB300 NVL72 (per GPU)x2 INT4H100 NVL 94GB (per GPU pair)x2 INT4Instinct MI300Xx2 INT4
GPU Recommendations
B200 SXMoptimal
FP8 · 4 GPUs · tensorrt-llm
100/100
score
Throughput
280.0 tok/s
Cost/Month
$17044
Cost/M Tokens
$23.16
B100 SXMoptimal
FP8 · 4 GPUs · tensorrt-llm
100/100
score
Throughput
280.0 tok/s
Cost/Month
$17082
Cost/M Tokens
$23.21
B200 NVL (pair)optimal
FP8 · 2 GPUs · tensorrt-llm
100/100
score
Throughput
280.0 tok/s
Cost/Month
$19929
Cost/M Tokens
$27.08
API Pricing Comparison
| Provider | Input $/M | Output $/M | Badges |
|---|---|---|---|
| snowflake | $1.50 | $1.50 | Cheapest |
Capabilities
Features
✗ Tool Use✗ Vision✓ Code✗ Math✗ Reasoning✗ Multilingual✓ Structured Output
Supported Frameworks
vllmsglang
Supported Precisions
BF16 (default)FP8INT4