What is the difference between H200 SXM and A100 80GB SXM?

The H200 SXM has 141GB HBM3e with 990 BF16 TFLOPS, while the A100 80GB SXM has 80GB HBM2e with 312 BF16 TFLOPS. The H200 SXM has 4800 GB/s memory bandwidth vs 2039 GB/s for the A100 80GB SXM.

Which GPU is cheaper, H200 SXM or A100 80GB SXM?

The cheapest on-demand rate for the H200 SXM is $2.69/hr, while the A100 80GB SXM starts at $1.19/hr. The A100 80GB SXM is +126% cheaper.

How many AI models fit on the H200 SXM vs the A100 80GB SXM?

At FP16 precision on a single GPU, the H200 SXM can run 193 models from our catalog, while the A100 80GB SXM can run 182 models. The H200 SXM supports 11 more models due to its 141GB VRAM.

H200 SXM vs A100 80GB SXM

Side-by-side comparison of the NVIDIA H200 SXM and the NVIDIA A100 80GB SXM for AI inference workloads.

Specifications

Spec	H200 SXM	A100 80GB SXM
Generation	hopper	ampere
Memory Type	HBM3e	HBM2e
VRAM	141 GB	80 GB
Memory Bandwidth	4,800 GB/s	2,039 GB/s
BF16 TFLOPS	990	312
FP16 TFLOPS	990	312
FP8 TFLOPS	1,979	312
INT8 TOPS	1,979	624
TDP	700 W	400 W
Interconnect	nvlink	nvlink
NVLink Bandwidth	900 GB/s	600 GB/s
Max GPUs per Node	8	8
PCIe Gen	Gen 5	Gen 4
CUDA Compute Capability	9	8

Pricing

H200 SXM

Provider	On-Demand	Reserved	Spot
lambda	$3.49/hr	$2.69/hr	-
coreweave	$4.25/hr	$3.19/hr	-
runpod	$4.69/hr	-	-
tensordock	$3.80/hr	-	$2.90/hr

A100 80GB SXM

Provider	On-Demand	Reserved	Spot
runpod	$2.72/hr	-	$2.09/hr
lambda	$1.99/hr	$1.49/hr	-
coreweave	$2.21/hr	$1.62/hr	-
aws	$3.67/hr	$2.39/hr	-
gcp	$3.67/hr	$2.48/hr	-
azure	$3.67/hr	$2.45/hr	-
vast ai	$1.80/hr	-	$1.30/hr
tensordock	$1.79/hr	-	$1.29/hr
fluidstack	$1.69/hr	-	$1.19/hr

Cheapest available rate: H200 SXM at $2.69/hr vs A100 80GB SXM at $1.19/hr — A100 80GB SXM is +126% cheaper

Efficiency Metrics

TFLOPS / Watt

1.4

H200 SXM

0.8

A100 80GB SXM

BF16

VRAM / Dollar

52.4

H200 SXM

67.2

A100 80GB SXM

GB/$/hr

Bandwidth / Watt

6.9

H200 SXM

5.1

A100 80GB SXM

GB/s/W

Models (FP16, 1 GPU)

193.0

H200 SXM

182.0

A100 80GB SXM

Model Compatibility (FP16, Single GPU)

Only on H200 SXM (11)

Jamba 1.5 Mini
Amazon Nova Pro
Falcon 40B
Mixtral 8x7B
Llama 3.1 Nemotron 51B
VILA 1.5 40B
Gemini 2.0 Flash
Gemini 1.5 Flash
Jamba Instruct
Phi 3.5 MoE
Mixtral 8x7B Instruct

Both (182)

Yi 1.5 34B
Yi 1.5 9B
Yi Coder 9B
GTE Qwen2 7B
Marco O1
Qwen 1.5 MoE A2.7B
Qwen 2 Audio 7B
Qwen 2.5 14B
Qwen 2.5 32B
Qwen 2.5 3B
Qwen 2.5 Coder 32B
OLMo 2 13B
OLMo 2 7B
Amazon Nova Lite
OpenELM 3B
BGE Large EN v1.5
BGE M3
Baichuan 2 13B
OctoCoder 15B
StarCoder2 15B
+162 more

Only on A100 80GB SXM (0)

None

Summary

The H200 SXM (hopper generation) offers 141GB of HBM3e with 990 BF16 TFLOPS and 4,800 GB/s memory bandwidth at 700W TDP.

The A100 80GB SXM (ampere generation) offers 80GB of HBM2e with 312 BF16 TFLOPS and 2,039 GB/s memory bandwidth at 400W TDP.

The H200 SXM has +76% more VRAM, allowing it to run larger models without multi-GPU setups.

From a cost perspective, the A100 80GB SXM is more affordable at $1.19/hr vs $2.69/hr for the H200 SXM.

More GPU Comparisons

H100 SXM vs A100 80GB SXM H200 SXM vs H100 SXM H100 SXM vs H100 PCIe A100 80GB SXM vs A100 40GB SXM RTX 4090 vs L40S H100 SXM vs B200 SXM A100 80GB SXM vs L40S RTX 3090 vs RTX 4090 H100 NVL vs H100 SXM B200 SXM vs H200 SXM B200 SXM vs B100 SXM H100 SXM vs L40S