Skip to content

Multiplanar Fabric AI Networking Primer

Reference architecture and design diagrams for MRC (Multipath Reliable Connection) networking — the OCP-standard approach to scaling AI training clusters over deterministic, multi-plane Ethernet fabrics.

MRC replaces traditional InfiniBand-style fat-tree designs with eight fully independent fabric planes, static SRv6 source routing, and NIC-driven path selection. The result is predictable, high-bandwidth inter-node communication for distributed training workloads without ECMP, dynamic routing, or application changes.


What is MRC?

Multipath Reliable Connection (MRC) is defined in OCP MRC Rev 1.0 and targets large-scale GPU clusters where collective operations (all-reduce, all-gather) dominate inter-node traffic.

Principle Description
Eight independent planes Each plane is a dedicated leaf–spine pair with no cross-plane links. Traffic on plane n never traverses plane m.
NIC-to-plane wiring NIC n on every node connects to the leaf switch of plane n. This discipline is the foundation of the entire design.
Static SRv6 routing Paths are encoded in an SRv6 uN uSID stack at the destination address. Switches perform deterministic uSID shifts — no BGP, no ECMP.
32-bit Entropy Value (EV) A per-packet identifier striped across the IPv6 flow label and UDP source port. EVs are generated at QP startup (128–256 per QP, split across planes) and echoed in SACK/NACK feedback for path health monitoring.
Lossy Ethernet RDMA PFC is disabled. The MRC engine in the SmartNIC handles retransmission and congestion response at the transport layer.

At the node level, intra-node traffic (NVLink on Nvidia, Infinity Fabric on AMD) handles local all-reduce, while inter-node traffic flows through MRC-capable SmartNICs over the eight-plane fabric.


Documentation map

Protocol & packet format

Understand how MRC encodes paths and entropy in every packet.

Node-level integration

How individual compute nodes connect GPUs to the eight-plane fabric.

Vendor-agnostic fabric

MRC is fabric-agnostic. Both Nvidia and AMD nodes implement the same OCP MRC Rev 1.0 semantics — identical EV generation, SRv6 forwarding, and plane wiring. An Arista 7060XE7 switch sees indistinguishable traffic from either vendor, making mixed AMD + Nvidia clusters architecturally valid.

Rack & data center deployments

End-to-end rack layouts pairing compute with MRC fabric switches.


Key design parameters

Parameter H100 generation Blackwell generation
NIC ConnectX-7 ConnectX-8 SuperNIC
Uplink speed 400G per plane 800G per plane
NICs per node 8 8
Fabric BW per node 3.2 Tbps 6.4 Tbps
Leaf/spine switch Arista 7060XE7 Arista 7060XE7
Planes 8 (independent) 8 (independent)
Routing SRv6 uN uSID (static) SRv6 uN uSID (static)
EVs per QP 128–256 128–256
ECMP / dynamic routing Disabled Disabled

Architecture at a glance

MRC architecture overview — compute node with eight SmartNICs wired to eight independent leaf-spine fabric planes

Each plane operates as an isolated 2-hop leaf–spine network. A 72-GPU rack typically pairs nine compute nodes with sixteen Arista 7060XE7 switches — one leaf and one spine per plane — delivering deterministic any-to-any connectivity for collective traffic.


Getting started

Browse the Pages section in the navigation sidebar for interactive SVG diagrams covering node topology, rack wiring, packet headers, and vendor comparisons. Each page is a self-contained reference diagram with legends, callouts, and design notes.


Resources