NVIDIA Networking Buyers Guide | Scan UK

NVIDIA Networking for AI and HPC

Demanding AI and HPC workloads require networking infrastructure that provides rapid data transfer between storage systems and GPU-accelerated servers. This is achieved by connecting these devices with specialised high-throughput, low-latency switches and network interface cards.

This type of networking hardware is manufactured by NVIDIA (following its acquisition of Mellanox in 2020), complementing DGX, HGX, MGX and EGX server platforms. Learn more in our AI Training Buyers Guide.

This guide covers NVIDIA Quantum and Spectrum network switches plus NVIDIA ConnectX Smart NICs, Super NICs and BlueField DPUs for AI and HPC workloads. For traditional office and enterprise networking, read our Network Switches Buyers Guide and Network Cards Buyers Guide.

Network Topology

Datacentres designed for AI and HPC and workloads employ a specific type of network topology known as spine-leaf architecture. It is a two-tier network architecture where leaf switches connect to clusters of GPU-accelerated servers and spine switches connect to all leaf switches. The spine layer is built with three or more switches, but there are no connections between spines as the two switch levels are connected using routing fabric.

For larger networks, leaf switches can be incrementally added and aggregated with the spines layer to create a POD. To scale-up the datacentre multiple PODs are connected via another layer of switches - called super-spines. These two architectures are illustrated below.

Two-tier network architecture

Three-tier network architecture

These spine-leaf and super spine-spine-leaf network architectures offer higher performance and scalability than the traditional datacentre network architecture with access, aggregation, and core layers, as highlighted in the table below.

Feature	Traditional Architecture	Spine-Leaf Architecture
Layers	Three tiers: access, aggregation, and core	Two tiers: spine and leaf or Three tiers: super spine, spine and leaf
Interconnection	Hierarchical connections: Access connects to aggregation; aggregation connects to core	Every leaf switch connects to every spine switch
Traffic Flow	Primarily designed for north-south traffic (end-user to datacentre)	Optimised for east-west traffic (server-to-server) with lower, predictable latency
Scalability	Less scalable; requires significant re-architecting for growth	Highly scalable; add more switches to either layer - scale-out without re-architecting
Path Utilisation	Relies on Spanning Tree Protocol (STP) routing to block redundant links, creating oversubscribed paths	Uses Equal Cost Multi Path (ECMP) routing to load balance traffic across all available paths simultaneously
Cabling	Fewer cables, but potentially less efficient use of bandwidth	Requires more cabling overall
Performance	Higher latency and potential bottlenecks at the aggregation layer	Lower latency and higher throughput due to fewer hops

Spine-leaf architectures support both InfiniBand and Ethernet. NVIDIA separates these protocols into two switch families: Quantum for InfiniBand and Spectrum for Ethernet.

Network Protocols

Traditionally InfiniBand was the choice of HPC and AI users, since it offered better throughput and lower latencies by offloading several functions onto the NIC rather have them carried out by the host CPU. In recent years, the Ethernet protocol has mirrored these offloading capabilities so the technologies offer similar performance.

NVIDIA Quantum for InfiniBand

It is fair to fair to say that InfiniBand is still the first choice for the most demanding AI workloads. It was designed from the ground up to support ultra-low latency and high throughput through technologies such as remote direct memory access (RDMA) that allows high-speed access to the memory of connected systems, bypassing the CPU(s) of either. InfiniBand is also lossless by design which means delays due to data packet loss are avoided, delivering predictable and reliable performance without requiring extensive tuning through quality-of-service (QoS) processes. That said, the specialist nature of the hardware can make it expensive, and dedicated InfiniBand knowledge is required to support it.

Version	Transfer Rate
Version	Gigabits per Second	Gigabytes per Second
FDR	54	6.75
EDR	100	12.5
HDR	200	25
NDR	400	50
XDR	800	100

NVIDIA Spectrum for Ethernet

Ethernet is the older traditional networking protocol, offering cost-effective but slower connectivity. Recent advancements such as remote direct memory access-over-converged Ethernet (RoCE) have acted to provide the same high-speed memory access without CPU overhead that InfiniBand has always enjoyed, so has levelled the playing field to a large extent. However, these breakthroughs are built on adapting and evolving a long-standard protocol rather than being innate in its design, hence why InfiniBand is still promoted as the protocol of choice - especially in the most demanding scenarios. That said, Ethernet is a well-known protocol so requires little specialist knowledge.

Version	Transfer Rate
Version	Gigabits per Second	Gigabytes per Second
50GbE	50	6.25
100GbE	100	12.5
200GbE	200	25
400GbE	400	50
800GbE	800	100

It is worth pointing out that NVIDIA Quantum and Spectrum switches, and ConnectX NICs are backwards compatible with slower InfiniBand and Ethernet speeds, however most AI and HPC workloads today require at least around 100Gb/s, with applications such as LLMs, generative AI, agentic AI and physical AI require ever increasing throughput due to the vast datasets and time required to train them.

Port Types

Both NVIDIA Quantum and Spectrum network switches use small form-factor pluggable (SFP) ports, which employ fibre optics to allow for the fastest data transmissions over the greatest distances. However, as speeds increase though the various generations of InfiniBand and Ethernet, enhanced versions have been designed including the four-channel QSFP (quad small form-factor pluggable), the QSFP-DD (double density) and the eight-channel OSFP (octal small form-factor pluggable). The latest NVIDIA switches remove the need for transceivers altogether, beading photonics to the die and employing MPO (multi-fibre push-on) connectors. The below table shows speeds and compatibilities.

Speed	SFP56	QSFP+	QSFP28	QSFP56	QSFP56-DD	QSFP112	OSFP	MPO
50Gb/s	✔	✖	✔	✔	✔	✔	✔	✖
100Gb/s	✖	✔	✔	✔	✔	✔	✔	✖
200Gb/s	✖	✖	✖	✔	✔	✔	✔	✖
400Gb/s	✖	✖	✖	✖	✔	✔	✔	✖
800Gb/s	✖	✖	✖	✖	✖	✖	✔	✔

These ports types are also mirrored on NVIDIA ConnectX Smart NICs, Super NICs and BlueField DPUs, as their throughput capabilities increase in line with the switches.

Switch and NIC Specifications

NVIDIA Quantum InfiniBand switches are available in various generations, defined by increasing performance and addition features. They are complemented by NVIDIA ConnectX Smart NICs, Super NICs and BlueField DPUs. Click the tabs below to explore the range.

Quantum X800 Switches

NVIDIA Quantum X800 switches are purpose-built for trillion-parameter-scale agentic and physical AI models, delivering 800Gb/s of end-to-end connectivity with ultra-low latency. They feature in-network compute acceleration technologies such as NVIDIA SHARP v4 (Scalable Hierarchical Aggregation and Reduction Protocol), which offloads collective communications (from AI training applications) from CPUs and GPUs directly onto the Quantum InfiniBand network. Some models also feature co-packaged optics, replacing pluggable transceivers with silicon photonics on the same die as the ASIC. This innovation provides 3.5x better power efficiency and 10x higher network resiliency.

NVIDIA Quantum-X800 Q3400-RA InfiniBand switch

NVIDIA Quantum-X Photonics InfiniBand switch

Model	Switch Backbone	Switch Ports	Performance	OS & Management	Cooling	Secure Boot
Q3200-RA	72x XDR, up to 28.8Tb/s	36x OSFP	72x XDR 800G	UFM External	Air	✔
Q3400-RA	144x XDR, up to 115.2Tb/s	72x OSFP	144x XDR 800G	UFM External	Air	✔
Q3401-RD	144x XDR, up to 115.2Tb/s	72x OSFP	144x XDR 800G	UFM External	Air	✔
Q3450-LD	144x XDR, up to 115.2Tb/s	144x MPO	144x XDR 800G	UFM External	Liquid / Air	✔

OS and Management

All Quantum X800 switches are externally managed via NVIDIA Unified Fabric Manager (UFM). This platform revolutionises datacentre network management by combining enhanced, real-time network telemetry with AI-powered cyber intelligence and analytics to support scale-out, InfiniBand-connected infrastructures. As UFM is also included in the software stack of NVIDIA DGX SuperPOD solutions, it is designed as a single universal management layer for your entire GPU-accelerated cluster.

Compatible Network Cards

ConnectX-8 Super NIC

XDR 800G

ConnectX-8 Super NICs are a specialised card optimised to accelerate network traffic for AI workloads. They are built around a high-performance network ASIC, making them more streamlined and energy efficient versus a DPU, prioritising GPU-to-GPU communication while minimising CPU overhead and latency.

ConnectX-9 Super NIC

GDR 1600G

ConnectX-9 Super NICs are the next generation of specialised card optimised to accelerate network traffic for AI workloads. Built around a high-performance network ASIC they will offer speeds of up to 1,600Gb/s when paired with Rubin-based GPU-accelerated servers - expected to be available in late 2026.

BlueField-3 DPU

NDR 400G

BlueField-3 DPUs offload, accelerate, and isolate software-defined networking, storage, security and management functions. This significantly enhances datacentre performance and efficiency, while also creating a secure, zero-trust environment that, streamlines operations and reduces the total cost of ownership.

BlueField-4 DPU

XDR 800G

BlueField-4 DPUs offload, accelerate, and isolate software-defined networking, storage, security and management functions in the same way as their BlueField-3 predecessors, but offer speeds up to 800Gb/s due to the latest PCIe Gen6 bus.

You can learn more about the different types of NVIDIA network cards and their capabilities, by watching our EXPLAINER VIDEO. A comparative summary is shown in the table below.

	Smart NIC	Super NIC	DPU
Network Protocols	InfiniBand / Ethernet	InfiniBand / Ethernet	InfiniBand / Ethernet
Network Speeds	100-400Gb/s	100-1600Gb/s	200-800Gb/s
CPU Offloading for Network Functions	✔	✔	✔
Memory Controllers	✔	✔	✔
GPU to GPU Prioritisation	✖	✔	✔
Network Accelerators	✖	✔	✔
Storage Accelerators	✖	✖	✔
Crypto Accelerators	✖	✖	✔
Power Consumption	Low	Medium	High
Cost	££	£££	££££

Direct Attach Copper Cable

Active Oxygen Cables

Multi-Fibre Push-On Cable

NVIDIA LinkX interconnects are designed to link upward in Quantum switching architectures for switch-to-switch applications; downward for top-of-rack switch links to ConnectX Super NICs, or BlueField DPUs in compute servers and storage systems. Direct Attach Copper (DAC) cables or Active Oxygen Cables (AOCs) use 100G-PAM4 modulation to provide the highest-throughput, lowest-latency connections; and are available in QSFP or OSFP transceiver module terminations. Multi-fibre Push-On (MPO) cables are also available for with silicon photonics switches.

Managed Services

The switches and network cards discussed in this guide are just one element of the comprehensive infrastructure required to realise high-performance models such as LLMs, generative, agentic and physical AI. GPU-accelerated servers, AI-optimised storage, software applications and frameworks and hosting offerings all play into a wider ecosystem.

As an NVIDIA Elite Partner and the UK's only NVIDIA-certified DGX Managed Services Partner, Scan is ideally placed to help guide your AI journey. From educational courses to proof-of-concept, through system configuration and network design, to deployment, installation and monitoring, Scan's system architects and data scientists are here to be your trusted advisor at every stage.

Ready to Buy?

Browse our range of NVIDIA Networking products:

NVIDIA QUANTUM X800 SWITCHES NVIDIA CONNECTX SMART NICs NVIDIA LINKX INTERCONNECTS

Quantum-2 Switches

NVIDIA Quantum-2 switches are purpose-built for training and deploying LLMs and generative AI models, delivering 400Gb/s of end-to-end connectivity with ultra-low latency. These switches feature in-network compute acceleration technologies such as NVIDIA SHARP v3 (Scalable Hierarchical Aggregation and Reduction Protocol), which offloads collective communications (from AI training applications) from CPUs and GPUs directly onto the Quantum InfiniBand network.

NVIDIA Quantum-2 switches are available in the following models.

Model	Switch Backbone	Switch Ports	Performance	OS & Management	Airflow	Secure Boot option
QM9700	64x NDR Up to 51.2Tb/s throughput	32x OSFP	64x NDR 400G or 128x HDR 200G	MLNX-OS Internal	Forward (P2C) or Reverse (C2P)	✔
QM9790	64x NDR Up to 51.2Tb/s throughput	32x OSFP	64x NDR 400G or 128x HDR 200G	UFM External	Forward (P2C) or Reverse (C2P)	✔