The NVIDIA Vera Rubin platform fuels the future of AI factory. It delivers breakthrough performance for AI training, AI reasoning, and agentic AI workloads at significantly lower cost per million tokens than NVIDIA Blackwell, enabling large-scale AI deployments.
The NVIDIA Vera Rubin platform marks a performance leap with optimized compute and 288GB HBM4 memory per GPU, with almost 3x HBM4 and 2x NVIDIA NVLink™ 6 bandwidth to elevate GPU-to-GPU communication efficiency with low latency.
NVIDIA Vera Rubin NVL72, unifying 72 NVIDIA Rubin GPUs and 36 NVIDIA Vera CPUs, delivers massive compute density and up to 20.7 TB of HBM4 memory. It leverages NVIDIA NVLink™ 6 switches to interconnect GPUs for scale-up intelligence, integrating the NVIDIA ConnectX®-9 SuperNIC™ and NVIDIA BlueField®-4 DPU for elevated networking capabilities.
NVIDIA Vera Rubin NVL72 is also optimized for scale-out deployments and AI factories. By integrating NVIDIA Quantum-X800 InfiniBand and NVIDIA Spectrum™-X Ethernet, it delivers breakthrough performance for LLM, agentic AI, AI reasoning and video inferencing applications.
Discover the power of QCT’s cutting-edge infrastructures accelerated by NVIDIA Blackwell Ultra GPU, NVIDIA Grace Blackwell Superchip, and the latest NVIDIA data center PCIe GPUs. These QuantaGrid systems not only support future accelerators and liquid cooling to meet diverse workload needs, but also feature flexible, modular designs to speed up development and reduce time–to–market.
NVIDIA Vera Rubin NVL72, unifying 72 NVIDIA Rubin GPUs and 36 NVIDIA Vera CPUs, delivers massive compute density and up to 20.7 TB of HBM4 memory. It leverages NVIDIA NVLink™6 switches to interconnect GPUs for scale-up intelligence, integrating the NVIDIA ConnectX®-9 SuperNIC™ and NVIDIA BlueField®-4 DPU for elevated networking capabilities.
| Feature | Performance | Compared with Blackwell |
|---|---|---|
| NVFP4 Inference | 3.6 EFLOPS | 5x |
| NVFP4 Training | 2.5 EFLOPS | 3.5x |
| LPDDR5X Capacity | 54TB | 2.5x |
| HBM4 Capacity | 20.7TB | 1.5x |
| HBM4 Bandwidth | 1.6PB/s | 2.8x | Scale-up Bandwidth | 260 TB/s | 2x |
| Platform | NVIDIA Vera Rubin | CPU GPU |
(2) NVIDIA Vera CPUs (4) NVIDIA Rubin GPUs |
Memory |
CPU: Up to 768GB LPDDR5X per CPU GPU: Up to 288GB HBM4 per GPU |
Storage | (4) E1.S 9.5mm PCIe data SSDs (1) E1.S 9.5mm PCIe boot SSD |
Networking |
(1) NVIDIA® BlueField®-4 B4240V with dual 400Gb/s QSFP112 ports (8) NVIDIA ConnectX®-9 800G OSFP ports |
Power | 48-54V DC busbar clip | Dimensions | (W) 438 x (H) 43.6 x (D) 766mm |
The platform introduces a specialized G3.5 context memory tier between node-local SSDs and shared storage, extending effective GPU KV-cache capacity across the pod. By treating KV cache as a first-class AI data type and optimizing pod-level context management, it delivers up to 5x higher tokens-per-second and up to 5x better power efficiency for agentic AI workloads.
This AI-native architecture uses the NVIDIA® B4480SP Dual BF4 Storage Processor, NVIDIA DOCA™ Memos, NVIDIA, Dynamo, and NVIDIA Spectrum-X™ Ethernet to enable seamless, low latency context sharing across nodes. By rapidly pre-staging KV-cache context back to the GPU during prefill, it minimizes inference stalls and maximizes end-to-end throughput. This ensures that large-scale AI factories remain both high performing and power- and cost-effective.
| HPM |
B4480SP Dual Bluefield-4 STX Storage Processors, powered by NVIDIA Vera: (2) NVIDIA Vera CPU, 450W each (4) NVIDIA ConnectX®-9 | Memory | (16) SOCAMM DIMM slot, populate (4) SOCAMM for CMX | Networking | (4) NVIDIA ConnectX®-9 800G OSFP ports | Storage |
(24) E3.S NVMe data drives (2) M.2 2280 boot drive |
Cooling | Full liquid cooling | Power | 48-54V DC busbar | Form Factor | 2U | Dimensions | (W) 438 x (H) 87.5 x (D) 766mm |
QuantaEdge EGN77C-2U is a carrier-grade server system derived from NVIDIA Aerial RAN Computer Pro (ARC-Pro), an AI-RAN platform for software-defined, AI-native 5G and 6G networks. Built on the NVIDIA Grace Blackwell architecture, it integrates NVIDIA RTX PRO™ 4500 Blackwell Server Edition GPU, Grace CPUs, and embedded NVIDIA ConnectX®-8 networking to support time-sync and RAN Layer 1–3 workloads. Optimized for serviceability, reliability, and telco deployments, EGN77C-2U enables operators to evolve from traditional RAN to a scalable, software-defined AI-RAN architecture.
| Processor | (1) NVIDIA Grace™ CPU per node | Memory | (16) LPDDR5X chip-down per node | Networking | (16) 25GbE SFP28 + (2) 400GbE QSFP112 per node | Onboard Storage | (2) PCIe 5.0 M.2 2280 per node | Expansion Slots | (1) PCIe 5.0 x16 FHFL single-width slot per node | Dimensions | (W) 448 x (H) 87.5 x (D) 420mm | Form Factor | 2U2N EIA rackmount server |
The NVIDIA GB300 NVL72 brings enhanced compute and memory capabilities to the next generation of AI and accelerated computing with 72 interconnected NVIDIA Blackwell Ultra GPUs acting as one gigantic GPU.
This rack-level solution can be seamlessly integrated with existing NVIDIA GB200 NVL72 infrastructure, delivering higher performance with efficient liquid cooling. The design reduces energy use while maximizing compute density and optimizing floor space.
| Platform | (2) NVIDIA GB300 Grace Blackwell Ultra Superchip | Processor | (2) NVIDIA Grace™ CPU | GPU | (4) NVIDIA Blackwell Ultra GPUs | Memory |
CPU: Up to 480GB LPDDR5X per CPU GPU: Up to 279GB HBM3e per GPU |
Storage | (4) E1.S 15mm PCIe SSDs, (8) slots available | Onboard Storage | (1) PCIe M.2 22110/2280 SSDs | Networking | (1) NVIDIA® BlueField®-3 B3240 dual port 400G DPU (4) NVIDIA® ConnectX®-8 800Gb OSFP ports |
Power | 48-54V DC bus bar | Dimensions | (W) 438 x (H) 43.6 x (D) 766mm |
The NVIDIA GB200 NVL72 is a powerhouse that connects 36 NVIDIA Grace™ CPUs and 72 Blackwell GPUs via the NVIDIA NVLink™-C2C interconnect. Functioning as a single, colossal GPU, this liquid-cooled, rack-scale system is designed to navigate the complexities of trillion-parameter AI models with unprecedented ease.
| Platform | (2) NVIDIA GB200 Grace Blackwell Superchip | Processor | (2) NVIDIA Grace™ CPUs | GPU | (4) NVIDIA Blackwell GPUs | Memory | CPU: Up to 480GB LPDDR5x per CPU
GPU: Up to 186GB HBM3e per GPU |
Storage | (8) hot-swappable E1.S 15mm PCIe SSDs | Onboard Storage | (1) PCIe M.2 22110/2280 SSDs | Networking | (2) NVIDIA® BlueField®-3 B3240 dual port 400G DPUs (4) NVIDIA ConnectX®-7 400Gb OSFP ports |
Cooling | CPU/GPU: Liquid cooling cold plate Peripheral: (8) 4056 dual rotor fans |
Power | 48-54V DC bus bar clip | Dimensions | (W) 438 x (H) 43.6 x (D) 766mm |
The QuantaGrid D75E-4U is more than just an x86-based system built on the Intel® Xeon® platform. It adheres to the NVIDIA MGX™ architecture, offering a modular design that meets diverse AI applications and customer demands. This system is compatible with a full range of NVIDIA data center PCIe GPUs, including NVIDIA RTX PRO™ 6000 Blackwell Server Edition, NVIDIA H200 NVL, NVIDIA H100 NVL, NVIDIA L40S GPU, NVIDIA L4 GPU, NVIDIA A10 GPU, and NVIDIA A16 GPU, enabling unparalleled flexibility and performance. The NVIDIA H200 NVL is particularly suited for organizations with data centers seeking low-power, air-cooled enterprise rack designs. It delivers versatile acceleration for AI and HPC workloads of all sizes, making it an ideal choice for enterprises prioritizing efficiency and scalability. With the QuantaGrid D75E-4U, customers can maximize computing power in compact spaces. The system supports flexible GPU configurations—1, 2, 4, or 8 GPUs—allowing companies to optimize their existing rack infrastructure and tailor performance to their specific requirements.
| Processor | (2) Intel® Xeon® 6 processors, up to 350W TDP |
|---|---|
| Memory | (32) DDR5 RDIMM up to 6,400 MHz |
| Networking | (1) Dedicated 1GbE management port |
| Storage |
SKU1 - (4) DW GPUs
(12) hot-swappable E1.S SSDs
SKU2 - (8) DW GPUs
(24) hot-swappable E1.S SSDs
|
| Expansion Slot |
SKU1 - (4) DW GPUs
SKU2 - (8) DW GPUs
|
| GPU Expansion |
SKU1 - (4) DW GPUs
(4) NVIDIA RTX PRO™ 6000, NVIDIA H200, NVIDIA L40S
SKU2 - (8) DW GPUs
(8) NVIDIA RTX PRO™ 6000, NVIDIA H200, NVIDIA L40S
|
| Cooling | Air cooling (design reserved for liquid cooling) |
| Power | (3+1) 2700W/3200W CRPS Titanium PSUs |
| Form Factor | 4U Rackmount Server |
| Dimensions | (W) 438 x (H) 176 x (D) 800mm |
QCT systems based on the NVIDIA MGX™ architecture, such as the QuantaGrid S74G-2U, QuantaEdge EGX77GE-2U and new NVIDIA Grace™ Blackwell servers, allow different configurations of GPUs, CPUs and DPUs, shortening the time frame for building future compatible solutions. Based on the modular reference design, these configurations can not only support future accelerators, but also meet the requirements of diverse workloads, including those that incorporate liquid cooling, to shorten the development journey and reduce time to market.
| Processor | NVIDIA GH200 Grace Hopper™ Superchip, 1000W TDP | Memory | CPU: Up to 480GB LPDDRX embedded GPU: 144GB HBM3E memory Coherent memory between CPU and GPU with NVIDIA NVLink™-C2C interconnect with a speed of 900GB/s |
Storage | (4) hot-swappable E1.S NVMe SSDs | Networking | (1) Dedicated 1GbE management port | Expansion Slot | (3) FHFL DW PCIe 5.0 x16 slots | Dimensions | (W) 438 x (H) 87.5 x (D) 900mm |
| Processor | NVIDIA GH200 Grace Hopper™ Superchip, 1000W TDP | Storage | Internal Storage: (2) SATA/NVMe M.2 22110/2280 SSDs External Storage: (2) E1.S SSDs |
Expansion Slot | (3) FHFL PCIe 5.0 x16 slots | Dimensions | (W) 447.8 x (H) 86.8 x (D) 400mm |
| Processor | (2) Intel® Xeon® 6 processors, up to 350W TDP
| Memory | (32) DDR5 RDIMM up to 6,400 MTs, (16) MRDIMM up to 8,000 MTs | Networking |
(8) 800Gb/s OSFP ports via NVIDIA ConnectX®-9 SuperNIC™ cards (1) Dedicated LAN port (RJ45) for BMC management (1) 1GbE LAN port (RJ45) |
Accelerator | (8) NVIDIA Rubin SXM8 GPUs | Storage |
(8) hot-swappable E1.S SSDs (2) PCIe M.2 2280 SSDs |
Expansion Slots | (1) FHFL SW PCIe 6.0* x16 slot for NVIDIA® BlueField®-4 | Cooling | Fully liquid cooled | Power | 48-54V DC busbar clip | Dimensions | (W) 448 x (H) 87 x (D) 800mm |
Note: *Motherboard support PCIe gen5, with other PCBA support PCIe gen6.
| Processor | (2) AMD EPYC™ 9006 Server CPUs | Memory | Supports industry-standard DDR5 RDIMM and MRDIMMs based memory | Networking |
(8) OSFP ports serving (8) single-port NVIDIA ConnectX®-9 (1) Dedicated LAN port (RJ45) for BMC management (1) 1GbE LAN port (RJ45) |
Storage |
(8) Hot-swappable PCIe 6.0 E1.S drives (2) PCIe M.2 2280 drives |
Expansion Slot | (1) FHFL SW PCIe 6.0 x16 slot for NVIDIA® BlueField®-4 | Cooling | Full liquid cooling | Power | 48-54V DC busbar clip | Form Factor | 2U rackmount server | Dimensions | (W) 448 x (H) 87 x (D) 800mm |
| Processor | (2) Intel® Xeon® 6 processors, up to 350W | Networking |
(1) Dedicated LAN port (RJ45) for BMC management (1) 1GbE LAN port (RJ45) (8) OSFP ports serving (8) single-port NVIDIA ConnectX-8 SuperNICs™ |
Accelerator | (8) NVIDIA Blackwell Ultra GPUs | Storage | (8) hot-swappable E1.S SSDs (2) PCIe M.2 2280 SSDs |
Expansion Slot | (4) FHHL SW or (2) FHHL DW PCIe 5.0 x16 slots |
Dimensions | (W) 447 x (H) 441.75 x (D) 800mm |
| Processor |
(2) Intel® Xeon® 6 processors, up to 350W | Networking |
(1) Dedicated LAN port (RJ45) for BMC management (1) 1GbE LAN port (RJ45) (8) OSFP ports serving (8) single-port NVIDIA ConnectX®-8 VPI |
Accelerator | (8) NVIDIA B300 SXM6 GPUs | Storage |
(8) hot-swappable E1.S SSDs (2) PCIe M.2 22110 SSDs |
Expansion Slot | (3) FHHL SW PCIe 5.0 x16 slots | Dimensions | (W) 438 x (H) 131.7 x (D) 864.5 mm |
| Processor | (2) AMD EPYC™ 9004/9005 Series processors, up to 500W (8) NVIDIA H200 SXM 8-GPU, 700W TDP | Networking | (1) Dedicated 1GbE management port | Storage | (18) hot-swappable 2.5" NVMe SSDs | Expansion Slot | (2) FHHL DW PCIe 5.0 x16 slots (8) HHHL SW PCIe 5.0 x16 slots |
Dimensions | (W) 447.8 x (H) 219.5 x (D) 950mm |
| Processor | (2) 5th/4th Gen Intel® Xeon® Scalable processors, up to 350W (1) NVIDIA HGX™ H200 8-GPU baseboard, 700W TDP | Networking | (1) Dedicated 1GbE management port | Storage | Front (18) hot-swap 2.5" PCIe 5.0 NVMe SSDs | Expansion Slot | (1) FHHL SW PCIe 5.0 x16 slot (1) OCP NIC 3.0 SFF PCIe 5.0 x16 slot (10) OCP NIC 3.0 TSFF PCIe 5.0 x16 slots |
Dimensions | (W) 447.8 x (H) 307.85 x (D) 886mm |
| Processor |
(2) 5th/4th Gen Intel® Xeon® Gen Scalable processors, up to 350W TDP | Networking | (1) Dedicated 1GbE management port | Accelerator | (8) NVIDIA H200 SXM5 GPUs | Storage | (18) hot-swappable 2.5" NVMe SSDs | Expansion Slot |
[D75F-7U]: (2) FHHL DW PCIe 5.0 x16 slots + (8) HHHL SW PCIe 5.0 x16 slots [D74F-7U]: (1) FHHL SW PCIe 5.0 x16 slot + (1) OCP 3.0 SFF slot + (10) OCP NIC 3.0 TSFF slots [D74H-7U]: (2) OCP 3.0 SFF slots + (10) OCP NIC 3.0 TSFF slots |
Dimensions |
[D75F-7U]: (W) 447.8 x (H) 307.85 x (D) 950mm [D74F-7U/D74H-7U]: (W) 447.8 x (H) 307.85 x (D) 886mm |
QCT has adopted a rich portfolio of NVIDIA cutting edge GPUs to accelerate some of the world’s most demanding workloads including Al, HPC and data analytics, pushing the boundaries of innovation from cloud to edge.
| Processor | (2) 5th/4th Gen Intel® Xeon® Scalable processors, up to 350W TDP | Networking | (1) Dedicated 1GbE management port | Accelerator | NVIDIA H200 NVL GPU, NVIDIA RTX PRO™ 6000 Blackwell Server Edition GPU, NVIDIA H100 GPU, NVIDIA L40S GPU, NVIDIA L40 GPU, NVIDIA A16 GPU, NVIDIA A2 GPU | Storage | (10) hot-swappable 2.5" SATA/SAS/NVMe SSDs | Expansion Slot |
(1) OCP 3.0 PCIe 5.0 x16 slot (2) HHHL PCIe 5.0 x16 slots (1) HHHL PCIe 5.0 x8 slots |
Dimensions | (W) 438 x (H) 131.6 x (D) 760mm |
| Processor | (2) Intel® Xeon® 6 processors, up to 330W/350W | Networking | (1) Dedicated 1GbE management port | Accelerator | NVIDIA H100 GPU, NVIDIA L40S GPU, NVIDIA L4 GPU | Storage |
SKU -#1 (12) hot-swappable 3.5" SATA/SAS or (12) hot-swappable 2.5 NVMe SSDs SKU -#2 (24) hot-swappable 2.5" NVMe/ SATA/ SAS SSDs |
Expansion Slot |
Option1 (4) FHHL PCIe 5.0 x8 slots+ (2) HHHL PCIe 5.0 x16 slots + (1)HHHL PCIe 5.0 x8 slots + (2) OCP 3.0 slots Option2 (4) FHHL PCIe 5.0 x8 slots + (2) HHHL PCIe 5.0 x16 slots + (1) HHHL PCIe 5.0 x8 slots + (2) OCP 3.0 slots [supports SW GPU] Option3 (3) FHFL PCIe 5.0 x16 slots + (2) HHHL PCIe 5.0 x16 slots + (1)HHHL PCIe 5.0 x8 slots + (2) OCP 3.0 slots [supports DW GPU] |
Dimensions | (W) 440 x (H) 87.5 x (D) 780mm |
| Processor | (2) Intel® Xeon® 6 processors, up to 330W/350W | Networking | (1) Dedicated 1GbE management port | Accelerator | NVIDIA L4 GPU | Storage |
SKU - #1 (12) hot-swappable 2.5" SSDs SKU - #2 (16) hot-swappable E1.S SSDs SKU - #3 (20) hot-swappable E3.S SSDs |
Expansion Slot |
Option 1 (3) HHHL PCIe 5.0 x16 slots + (2) OCP 3.0 slots Option 2 (2) FHHL PCIe 5.0 x16 slots + (2) OCP 3.0 slots |
Dimensions | (W) 440 x (H) 43.2 x (D) 780mm |
| Processor | (1) Intel® Xeon® 6 processor, up to 350W TDP | Networking | (1) Dedicated 1GbE management port | Accelerator | NVIDIA L4 GPU | Storage | (12) hot-swappable 2.5“ NVMe/SATA/SAS SSDs | Expansion Slot |
Option 1 (2) HHHL PCIe 5.0 x16 slots + (1) OCP 3.0 slot Option 2 (2) FHHL PCIe 5.0 x16 slots + (1) OCP 3.0 slot |
Dimensions | (W) 440 x (H) 43.2 x (D) 780mm |
| Processor |
[S44NL-1U]: (1) AMD EPYC™ 9004/9005 Series processor, up to 500W TDP [D44N-1U]: (2) AMD EPYC™ 9004/9005 Series processors, up to 500W TDP | Networking | (1) Dedicated 1GbE management port | Accelerator | NVIDIA L4 GPU | Storage |
SKU - #1 (12) hot-swappable 2.5″ NVMe/SATA/SAS SSDs SKU - #2 (16) hot-swappable E1.S SSDs |
Expansion Slot |
Option 1 (3) HHHL PCIe 5.0 x16 slots Option 2 (2) FHHL PCIe 5.0 x16 slots + (2) OCP 3.0 slots |
Dimensions | (W) 440x (H) 43.2 x (D) 780mm |
| Processor | (1) Intel® Xeon® 6 processor | Networking | (1) OCP 3.0 SFF slot | Accelerator | NVIDIA L40S GPU, NVIDIA L4 GPU, NVIDIA A2 GPU | Storage | (3) hot-swappable 2.5" NVMe SSDs | Expansion Slot |
Option 1 (1) FHFL SW PCIe 5.0 x16 slot (1) FHFL SW PCIe 5.0 x8 slot (1) FHHL SW PCIe 5.0 x16 slot Option 2 (1) FHFL DW PCIe 5.0 x16 slot (1) FHHL SW PCIe 5.0 x16 slot |
Dimensions | (W) 447.8 x (H) 86.3 x (D) 875mm |
| Processor | (1) Intel® Xeon® 6 processor | Networking | (1) OCP 3.0 SFF slot | Accelerator | NVIDIA L4 GPU | Storage | (2) hot- swappable E1.S SSDs | Expansion Slot | (1) HHHL PCIe Gen5 x16 | Dimensions | (W) 447.8 x (H) 86.3 x (D) 875mm |
| Processor | (1) Intel® Xeon® 6 processor, up to TDP 325W | Networking |
SKU - #1 (16) 25GbE SFP28 (LoM) SKU - #2 (24) 25GbE SFP28 (LoM & Intel Carter Flat) |
Accelerator | NVIDIA L4 GPU | Storage | (2) SATA/NVMe M.2 22110/2280 SSDs | Expansion Slot |
SKU - #1 (1) FHHL PCIe 5.0 slot SKU - #2 (1) OCP 3.0 PCIe 5.0 slot for Intel Carter Flat |
Dimensions | (W) 447.8 x (H) 42.8 x (D) 300.65mm |
| Processor | (1) 5th/4th Gen Intel® Xeon® Scalable processor, up to 250W TDP | Networking |
SKU - #1 (8) 25GbE w/ Sync-E, NCSI SKU - #2 (4) 25GbE and (8) 10GbE w/ Sync-E, NCSI |
Accelerator | NVIDIA L4 GPU | Storage | (2) SATA/NVMe M.2 2280 SSDs | Expansion Slot | (1) FHHL PCIe 5.0 x16 slot | Dimensions | (W) 447.8 x (H) 42.8 x (D) 300.65mm (ear to rear wall) |
| Processor | (1) 4th Gen Intel® Xeon® Scalable processor, up to 250W TDP | Networking |
(4) 25GbE SFP28 ports (NCSI) (1) 1GbE RJ45 management port |
Accelerator | NVIDIA L4 GPU | Storage |
SKU - #1 (2) SATA/NVMe M.2 2280 SSDs SKU - #2 (2) SATA/NVMe M.2 2280 SSDs (2) 2.5" U.2 SSDs |
Expansion Slot |
SKU - #1 (2) FH3/4L PCIe 5.0 x16 slots (1) FHHL PCIe 5.0 x16 slot SKU - #2 (2) FH3/4L PCIe 5.0 x16 slots |
Dimensions | (W) 447.8 x (H) 42.8 x (D) 400mm |
QCT Cutting-Edge Infrastructures
Accelerated by NVIDIA
Unleashing the Power of NVIDIA HGX™
for AI and HPC
QCT NVIDIA MGX™ Systems
Official Overview