Updated minutes ago
Jamba 1.5 Large
AI21 · hybrid · 398B parameters · 256,000 context
Quality50.0
Architecture Details
TypeHYBRID
Total Parameters398B
Active Parameters52B
Layers64
Hidden Dimension8,192
Attention Heads64
KV Heads8
Head Dimension128
Vocab Size65,536
Memory Requirements
BF16 Weights
796.0 GB
FP8 Weights
398.0 GB
INT4 Weights
199.0 GB
KV-Cache per Token131072 bytes
Activation Estimate3.00 GB
Fits on (single-node)
Instinct MI325X INT4B200 NVL (pair) INT4B300 INT4B200 SXMx2 INT4B100 SXMx2 INT4GB200 NVL72 (per GPU)x2 INT4GB300 NVL72 (per GPU)x2 INT4H200 SXMx2 INT4
GPU Recommendations
B200 NVL (pair)optimal
FP8 · 2 GPUs · tensorrt-llm
100/100
score
Throughput
280.0 tok/s
Cost/Month
$19929
Cost/M Tokens
$27.08
B200 SXMoptimal
FP8 · 4 GPUs · tensorrt-llm
98/100
score
Throughput
280.0 tok/s
Cost/Month
$17044
Cost/M Tokens
$23.16
B100 SXMoptimal
FP8 · 4 GPUs · tensorrt-llm
98/100
score
Throughput
280.0 tok/s
Cost/Month
$17082
Cost/M Tokens
$23.21
API Pricing Comparison
| Provider | Input $/M | Output $/M | Badges |
|---|---|---|---|
| ai21 | $2.00 | $8.00 | Cheapest |
Capabilities
Features
✓ Tool Use✗ Vision✓ Code✓ Math✗ Reasoning✓ Multilingual✓ Structured Output
Supported Frameworks
vllmsglang
Supported Precisions
BF16 (default)FP8