Updated minutes ago
Claude Opus 4
Anthropic · dense · 200B parameters · 200,000 context
Quality94.0
Architecture Details
TypeDENSE
Total Parameters200B
Active Parameters200B
Layers96
Hidden Dimension12,288
Attention Heads96
KV Heads16
Head Dimension128
Vocab Size152,064
Memory Requirements
BF16 Weights
400.0 GB
FP8 Weights
200.0 GB
INT4 Weights
100.0 GB
KV-Cache per Token393216 bytes
Activation Estimate5.00 GB
Fits on (single-node)
B200 SXM INT4B100 SXM INT4GB200 NVL72 (per GPU) INT4GB300 NVL72 (per GPU) INT4H200 SXM INT4H100 NVL 94GB (per GPU pair) INT4Instinct MI300X INT4Instinct MI325X FP8
GPU Recommendations
B200 NVL (pair)optimal
BF16 · 2 GPUs · tensorrt-llm
83/100
score
Throughput
280.0 tok/s
Cost/Month
$19929
Cost/M Tokens
$27.08
H20optimal
BF16 · 8 GPUs · tensorrt-llm
80/100
score
Throughput
280.0 tok/s
Cost/Month
$7516
Cost/M Tokens
$10.21
B200 SXMgood
BF16 · 4 GPUs · tensorrt-llm
78/100
score
Throughput
280.0 tok/s
Cost/Month
$17044
Cost/M Tokens
$23.16
API Pricing Comparison
| Provider | Input $/M | Output $/M | Badges |
|---|---|---|---|
| anthropic | $15.00 | $75.00 | Cheapest |
Quality Benchmarks
MMLU91.5
HumanEval75.0
GSM8K97.0
MT-Bench92.0
Capabilities
Features
✓ Tool Use✓ Vision✓ Code✓ Math✓ Reasoning✓ Multilingual✓ Structured Output
Supported Frameworks
Supported Precisions
BF16 (default)