Amazon Web Services offers 13 GPU configurations with prices starting at $0.76/hour. Compared to the market average of $8.72/hour across all cloud GPU providers, Amazon Web Services's entry-level pricing is 91% below average. With autoscaling support, it is well-suited for variable inference workloads.
Provider Overview
Type
cloud
Billing
Per second
Egress
Paid
SLA Uptime
99.99%
Autoscaling
Yes
Cold Start
60000ms
Storage
$0.08/GB/mo
GPU Offerings (13)
| GPU | $/hr | Tier | Availability | Regions | |
|---|---|---|---|---|---|
| aws-inferentia2 | $0.76 | on demand | high | us-east-1, us-west-2 | Calculate → |
| nvidia-l4 | $0.80 | on demand | high | us-east-1, us-west-2 | Calculate → |
| nvidia-a10g | $1.01 | on demand | high | us-east-1, us-west-2, eu-west-1, ap-southeast-1 | Calculate → |
| nvidia-a10g | $1.21 | on demand | high | us-east-1, us-west-2, eu-west-1 | Calculate → |
| nvidia-a10g | $1.62 | on demand | high | us-east-1, us-west-2, eu-west-1 | Calculate → |
| aws-inferentia2 | $1.97 | on demand | high | us-east-1, us-west-2 | Calculate → |
| nvidia-l4 | $4.60 | on demand | high | us-east-1, us-west-2 | Calculate → |
| nvidia-a10g | $5.67 | on demand | high | us-east-1, us-west-2 | Calculate → |
| aws-inferentia2 | $12.98 | on demand | medium | us-east-1, us-west-2 | Calculate → |
| nvidia-a100-80gb-sxm | $19.22 | 1yr reserved | high | us-east-1, us-west-2 | Calculate → |
| nvidia-a100-80gb-sxm | $32.77 | on demand | high | us-east-1, us-west-2, eu-west-1, ap-northeast-1 | Calculate → |
| nvidia-h100-sxm | $62.72 | 1yr reserved | medium | us-east-1, us-west-2 | Calculate → |
| nvidia-h100-sxm | $98.32 | on demand | medium | us-east-1, us-west-2, eu-west-1 | Calculate → |
Pricing History
nvidia-a10g via aws
0.0% overall
2024-01-01$1.52/hr2025-03-01
Reputation Details
Pricing
50
Reliability
90
Features
65
Highlights
- 99.9%+ SLA
- Autoscaling supported
Compare with Others
| Provider | Overall | Pricing | Reliability | Features | GPUs |
|---|---|---|---|---|---|
| Amazon Web Services | 67 | 50 | 90 | 65 | 13 |
| RunPod | 70 | 50 | 90 | 75 | 10 |
| Google Cloud Platform | 67 | 50 | 90 | 65 | 10 |
| Microsoft Azure | 67 | 50 | 90 | 65 | 9 |
| Lambda Labs | 62 | 50 | 90 | 50 | 8 |
Embed Badge
<a href="https://inferencebench.io/providers/aws/"><img src="data:image/svg+xml,%3Csvg%20xmlns%3D%22http%3A%2F%2Fwww.w3.org%2F2000%2Fsvg%22%20width%3D%22306%22%20height%3D%2220%22%20role%3D%22img%22%20aria-label%3D%22InferenceBench%20Verified%3A%20Amazon%20Web%20Services%22%3E%0A%20%20%3Ctitle%3EInferenceBench%20Verified%3A%20Amazon%20Web%20Services%3C%2Ftitle%3E%0A%20%20%3ClinearGradient%20id%3D%22s%22%20x2%3D%220%22%20y2%3D%22100%25%22%3E%0A%20%20%20%20%3Cstop%20offset%3D%220%22%20stop-color%3D%22%23bbb%22%20stop-opacity%3D%22.1%22%2F%3E%0A%20%20%20%20%3Cstop%20offset%3D%221%22%20stop-opacity%3D%22.1%22%2F%3E%0A%20%20%3C%2FlinearGradient%3E%0A%20%20%3CclipPath%20id%3D%22r%22%3E%0A%20%20%20%20%3Crect%20width%3D%22306%22%20height%3D%2220%22%20rx%3D%223%22%20fill%3D%22%23fff%22%2F%3E%0A%20%20%3C%2FclipPath%3E%0A%20%20%3Cg%20clip-path%3D%22url(%23r)%22%3E%0A%20%20%20%20%3Crect%20width%3D%22166%22%20height%3D%2220%22%20fill%3D%22%23333%22%2F%3E%0A%20%20%20%20%3Crect%20x%3D%22166%22%20width%3D%22140%22%20height%3D%2220%22%20fill%3D%22%238b5cf6%22%2F%3E%0A%20%20%20%20%3Crect%20width%3D%22306%22%20height%3D%2220%22%20fill%3D%22url(%23s)%22%2F%3E%0A%20%20%3C%2Fg%3E%0A%20%20%3Cg%20fill%3D%22%23fff%22%20text-anchor%3D%22middle%22%20font-family%3D%22Verdana%2CGeneva%2CDejaVu%20Sans%2Csans-serif%22%20text-rendering%3D%22geometricPrecision%22%20font-size%3D%2211%22%3E%0A%20%20%20%20%3Ctext%20aria-hidden%3D%22true%22%20x%3D%2283%22%20y%3D%2214%22%20fill%3D%22%23010101%22%20fill-opacity%3D%22.3%22%3EInferenceBench%20Verified%3C%2Ftext%3E%0A%20%20%20%20%3Ctext%20x%3D%2283%22%20y%3D%2213%22%3EInferenceBench%20Verified%3C%2Ftext%3E%0A%20%20%20%20%3Ctext%20aria-hidden%3D%22true%22%20x%3D%22236%22%20y%3D%2214%22%20fill%3D%22%23010101%22%20fill-opacity%3D%22.3%22%3EAmazon%20Web%20Services%3C%2Ftext%3E%0A%20%20%20%20%3Ctext%20x%3D%22236%22%20y%3D%2213%22%3EAmazon%20Web%20Services%3C%2Ftext%3E%0A%20%20%3C%2Fg%3E%0A%3C%2Fsvg%3E" alt="InferenceBench Verified — Amazon Web Services" /></a>