Skip to content

RTX A6000 vs L40S

Side-by-side comparison of the NVIDIA RTX A6000 and the NVIDIA L40S for AI inference workloads.

Specifications

SpecRTX A6000L40S
Generationampereada
Memory TypeGDDR6GDDR6
VRAM48 GB48 GB
Memory Bandwidth768 GB/s864 GB/s
BF16 TFLOPS39362
FP16 TFLOPS39362
FP8 TFLOPS39733
INT8 TOPS77733
TDP300 W350 W
Interconnectpciepcie
Max GPUs per Node88
PCIe GenGen 4Gen 4
CUDA Compute Capability8.68.9

Pricing

RTX A6000

ProviderOn-DemandReservedSpot
runpod$1.09/hr-$0.79/hr
lambda$0.99/hr--
vast ai$0.79/hr-$0.55/hr
tensordock$0.69/hr-$0.49/hr

L40S

ProviderOn-DemandReservedSpot
runpod$1.90/hr-$1.49/hr
lambda$1.59/hr$1.19/hr-
coreweave$1.84/hr$1.34/hr-
aws$2.56/hr$1.69/hr-
gcp$2.45/hr$1.62/hr-
vast ai$1.29/hr-$0.95/hr
tensordock$1.19/hr-$0.89/hr
fluidstack$1.09/hr-$0.85/hr

Cheapest available rate: RTX A6000 at $0.49/hr vs L40S at $0.85/hrRTX A6000 is +73% cheaper

Efficiency Metrics

TFLOPS / Watt

0.1

RTX A6000

vs

1.0

L40S

BF16

VRAM / Dollar

98.0

RTX A6000

vs

56.5

L40S

GB/$/hr

Bandwidth / Watt

2.6

RTX A6000

vs

2.5

L40S

GB/s/W

Models (FP16, 1 GPU)

162.0

RTX A6000

vs

162.0

L40S

Model Compatibility (FP16, Single GPU)

Only on RTX A6000 (0)

None

Both (162)

  • Yi 1.5 9B
  • Yi Coder 9B
  • GTE Qwen2 7B
  • Marco O1
  • Qwen 1.5 MoE A2.7B
  • Qwen 2 Audio 7B
  • Qwen 2.5 14B
  • Qwen 2.5 3B
  • OLMo 2 13B
  • OLMo 2 7B
  • Amazon Nova Lite
  • OpenELM 3B
  • BGE Large EN v1.5
  • BGE M3
  • Baichuan 2 13B
  • OctoCoder 15B
  • StarCoder2 15B
  • StarCoder2 3B
  • StarCoder2 7B
  • Aya 23 8B
  • +142 more

Only on L40S (0)

None

Summary

The RTX A6000 (ampere generation) offers 48GB of GDDR6 with 39 BF16 TFLOPS and 768 GB/s memory bandwidth at 300W TDP.

The L40S (ada generation) offers 48GB of GDDR6 with 362 BF16 TFLOPS and 864 GB/s memory bandwidth at 350W TDP.

From a cost perspective, the RTX A6000 is more affordable at $0.49/hr vs $0.85/hr for the L40S.

More GPU Comparisons