AMD Instinct MI350X — MRC + SRv6 Fabric Connectivity

AMD Pensando Pollara 400 NIC · 8× 400G per node · 8-plane MRC fabric · Arista 7060XE7 leaf+spine · OCP MRC Rev 1.0

AMD MI350X Node — Internal topology

AMD vs Nvidia — MRC node comparison

Feature	AMD MI350X	Nvidia B300
GPU	8× MI350X (CDNA4)	8× B300 (Blackwell)
Scale-up fabric	Infinity Fabric 1,075 GB/s	NVLink 5 1.8 TB/s
Scale-up silicon	On-package / PCIe switch	Dedicated NVSwitch ×4
Scale-out NIC	Pollara 400 400 Gbps	ConnectX-8 800 Gbps
NIC:GPU ratio	1:1 (8 NICs/node)	1:1 (8 NICs/node)
NIC host bus	PCIe Gen5 x16	PCIe Gen5 x16
MRC support	Yes P4 ASIC FW	Yes HW native
SRv6 uN SID	Yes	Yes
Packet spray	Yes (Pollara HW)	Yes (CX-8 HW)
Coll. lib.	RCCL + MRC shim	NCCL + MRC plugin
Fabric BW/node	3.2 Tbps	6.4 Tbps
Arista 7060XE7	Compatible	Compatible
Next-gen NIC	Vulcano 800G MI400, PCIe Gen6	CX-9 (future)
Key diff	PCIe switch connects GPUs + NICs. No dedicated NVSwitch-equivalent.	NVSwitch dedicated silicon for GPU mesh.

MRC is fabric-agnostic. Both implement OCP MRC Rev 1.0: 32-bit EV striped across UDP src port + IPv6 flow label; 128–256 EVs per QP split equally across planes; SRv6 uN SID static source routing; PFC disabled (lossy Ethernet); dynamic routing disabled. The Arista 7060XE7 sees identical traffic from both. Mixed AMD+Nvidia clusters on the same MRC fabric are architecturally valid.

MRC SRv6 fabric — AMD MI350X node → 8 planes → Arista 7060XE7

Pollara 400 — MRC capabilities

Intelligent packet spray — distributes packets across all 8 planes per NIC, 32-bit EV striped across UDP src port + IPv6 flow label · 128–256 EVs per QP, equally split across planes

In-order delivery — every packet carries RDMA virtual address + remote key → NIC writes directly to correct GPU memory location regardless of arrival order

Selective retransmission — SACK indicates precisely which packets arrived · selective retransmit only missing packets · packet trimming: payload stripped, header forwarded as loss signal

Path-aware congestion avoidance — ECN CE bit at leaf (not last hop) → CNP/NACK → Pollara swaps EV to cleaner plane · PFC disabled, runs lossy

Failover — packet loss (not trim) = path failure → EV retired immediately · background probes resurrect recovered paths

SW-programmable P4 ASIC — MRC transport runs in NIC firmware, updatable without hardware swap as spec evolves

SRv6 uN SID encoding — 32-bit EV maps algorithmically to SRv6 address template → specific switch path encoded in uN uSIDs · no dynamic routing in fabric

RCCL + MRC shim — OCP MRC Rev 1.0 ibverbs shim; RCCL workloads run over MRC with no source changes

Port config flexibility: Each Pollara 400 supports 1×400G, 2×200G, or 4×100G. In a multi-plane setup, a single NIC can cover 4 planes at 100G each, or 2 NICs cover all 8 planes at 100G — enabling MRC without a full 8-NIC configuration where cost is a concern.

Vulcano 800G (MI400 — upcoming)

PCIe Gen6 host interface — removes PCIe5 bandwidth bottleneck

800 Gbps per NIC — matches Nvidia CX-8

4×200G or 8×100G multi-plane configs

Up to 2.4 Tbps per GPU (3 NICs per GPU)

With 3× Vulcano NICs per MI400 GPU, AMD moves to a different plane-to-NIC mapping than Nvidia's 1:1. The Arista 7060XE7 fabric topology is unchanged — only the server-side cabling discipline differs.

8

Pollara NICs/node

3.2

Tbps fabric BW

8

MRC planes

400G

per uplink

P4

ASIC (SW prog.)

0

ECMP · no dyn. routing