Real-time NVIDIA B200 cloud pricing from 10 providers. Cheapest on-demand: $3.87/hr (Vultr). Updated daily by GridStackHub.
Sorted by cheapest per-GPU hourly rate. Includes on-demand, spot, and reserved pricing where available.
| Provider | Price/hr | Type | Region | VRAM | Updated |
|---|---|---|---|---|---|
|
VultrLowest
Cloud GPU B200
|
$3.87/hr | On-Demand | US | 192 GB | Fri Jun 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
RunPod
NVIDIA B200
|
$5.98/hr | On-Demand | us-east-1 | 180 GB | Wed Jun 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
RunPod
B200 192GB
|
$5.98/hr | On-Demand | US/EU | 192 GB | Fri Jun 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Vast.ai
B200 192GB (marketplace)
|
$6.40/hr | Spot | Various | 192 GB | Fri Jun 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Vast.ai
vast-b200
|
$6.40/hr | Spot | global | 179 GB | Mon Jun 01 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Google Cloud
a4-highgpu-8g (8x B200)
|
$6.60/hr $52.80/hr for 8× node |
On-Demand | us-central1 | 192 GB | Fri Jun 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda Labs
gpu_8x_b200_sxm6 (8x B200)
|
$6.69/hr $53.52/hr for 8× node |
On-Demand | US | 192 GB | Tue May 19 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda Labs
gpu_8x_b200_sxm6
|
$6.69/hr $53.52/hr for 8× node |
On-Demand | us-east-1 | 192 GB | Wed Jun 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda
HGX B200 SXM6 8x GPU
|
$6.69/hr $53.52/hr for 8× node |
On-Demand | US | 192 GB | Tue May 19 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda Labs
gpu_4x_b200_sxm6
|
$6.79/hr $27.16/hr for 4× node |
On-Demand | us-east-1 | 192 GB | Wed Jun 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda Labs
gpu_4x_b200_sxm6 (4x B200)
|
$6.79/hr $27.16/hr for 4× node |
On-Demand | US | 192 GB | Tue May 19 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda Labs
gpu_2x_b200_sxm6
|
$6.89/hr $13.78/hr for 2× node |
On-Demand | us-east-1 | 192 GB | Wed Jun 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda Labs
gpu_2x_b200_sxm6 (2x B200)
|
$6.89/hr $13.78/hr for 2× node |
On-Demand | US | 192 GB | Tue May 19 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
AWS
p6.48xlarge (8x B200)
|
$6.90/hr $55.20/hr for 8× node |
On-Demand | us-east-1 | 192 GB | Fri Jun 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Corvex
B200 SXM 192GB
|
$6.99/hr | On-Demand | US | 192 GB | Tue Jun 09 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda Labs
gpu_1x_b200_sxm6
|
$6.99/hr | On-Demand | us-east-1 | 192 GB | Wed Jun 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Lambda Labs
gpu_1x_b200_sxm6 (1x B200)
|
$6.99/hr | On-Demand | US | 192 GB | Tue May 19 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
Azure
ND B200 v6 (8x B200)
|
$7.05/hr $56.40/hr for 8× node |
On-Demand | East US | 192 GB | Fri Jun 05 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
CoreWeave
HGX_B200_x1 (1x B200)
|
$8.60/hr | On-Demand | US | 192 GB | Tue May 19 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
CoreWeave
HGX_B200_x8
|
$8.60/hr $68.80/hr for 8× node |
On-Demand | US | 192 GB | Wed Jun 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
|
CoreWeave
HGX B200 8x GPU
|
$8.60/hr $68.80/hr for 8× node |
On-Demand | US | 192 GB | Tue May 19 2026 00:00:00 GMT+0000 (Coordinated Universal Time) |
Key hardware specifications for the NVIDIA B200.
| Architecture | Blackwell (SXM) |
| VRAM | 192GB HBM3e |
| Memory Bandwidth | 8.0 TB/s |
| FP8 Throughput | 9,000 TFLOPS |
| FP16 Throughput | 4,500 TFLOPS |
| GPU-to-GPU Bandwidth | 1.8 TB/s (NVLink 5) |
| TDP | 1,000W |
| Gen | 5th Gen NVLink |
The NVIDIA B200 is the current flagship GPU, built on NVIDIA's Blackwell architecture. With 192GB HBM3e memory and 9,000 TFLOPS FP8 compute, it delivers 2.27× the throughput of an H200 and 2.27× that of an H100 SXM5 in memory-bandwidth-bound workloads.
B200 supply remains constrained in April 2026. Lambda ($5.29/hr) and CoreWeave ($5.49/hr) are the most accessible on-demand providers. Hyperscaler pricing — AWS at $6.90/GPU, Google Cloud at $6.60/GPU, Azure at $7.05/GPU — reflects normalized per-GPU rates from 8-GPU nodes.
The 192GB HBM3e enables running frontier-size models on a single GPU. For inference on 70B–405B parameter models, B200 on-demand eliminates multi-GPU tensor parallelism in many configurations, simplifying deployment and reducing interconnect overhead.
NVLink 5 interconnect on B200 nodes provides 1.8 TB/s GPU-to-GPU bandwidth — double H100 SXM5 NVSwitch bandwidth. For dense training on 1T+ parameter models, the B200's bandwidth advantage compounds across nodes. CoreWeave and Lambda have the deepest current B200 inventory.
Use GridStackHub's GPU cost calculator to get a ranked comparison with hidden-cost breakdown (egress + storage) across all providers.
📊 Open Calculator View All GPU Pricing