Skip to content
Updated minutes ago
AI21

Jamba 1.5 Large

AI21 · hybrid · 398B parameters · 256,000 context

Quality
50.0

Architecture Details

TypeHYBRID
Total Parameters398B
Active Parameters52B
Layers64
Hidden Dimension8,192
Attention Heads64
KV Heads8
Head Dimension128
Vocab Size65,536

Memory Requirements

BF16 Weights

796.0 GB

FP8 Weights

398.0 GB

INT4 Weights

199.0 GB

KV-Cache per Token131072 bytes
Activation Estimate3.00 GB

Fits on (single-node)

Instinct MI325X INT4B200 NVL (pair) INT4B300 INT4B200 SXMx2 INT4B100 SXMx2 INT4GB200 NVL72 (per GPU)x2 INT4GB300 NVL72 (per GPU)x2 INT4H200 SXMx2 INT4

GPU Recommendations

B200 NVL (pair)optimal

FP8 · 2 GPUs · tensorrt-llm

100/100

score

Throughput

280.0 tok/s

Cost/Month

$19929

Cost/M Tokens

$27.08

Use this config →
B200 SXMoptimal

FP8 · 4 GPUs · tensorrt-llm

98/100

score

Throughput

280.0 tok/s

Cost/Month

$17044

Cost/M Tokens

$23.16

Use this config →
B100 SXMoptimal

FP8 · 4 GPUs · tensorrt-llm

98/100

score

Throughput

280.0 tok/s

Cost/Month

$17082

Cost/M Tokens

$23.21

Use this config →

API Pricing Comparison

ProviderInput $/MOutput $/MBadges
ai21$2.00$8.00
Cheapest

Capabilities

Features

Tool Use Vision Code Math Reasoning Multilingual Structured Output

Supported Frameworks

vllmsglang

Supported Precisions

BF16 (default)FP8

Similar Models