AMD Instinct MI350X — MRC + SRv6 Fabric Connectivity
AMD Pensando Pollara 400 NIC · 8× 400G per node · 8-plane MRC fabric · Arista 7060XE7 leaf+spine · OCP MRC Rev 1.0
AMD MI350X Node — Internal topology
AMD vs Nvidia — MRC node comparison
| Feature | AMD MI350X | Nvidia B300 |
|---|---|---|
| GPU | 8× MI350X (CDNA4) | 8× B300 (Blackwell) |
| Scale-up fabric | Infinity Fabric 1,075 GB/s |
NVLink 5 1.8 TB/s |
| Scale-up silicon | On-package / PCIe switch | Dedicated NVSwitch ×4 |
| Scale-out NIC | Pollara 400 400 Gbps |
ConnectX-8 800 Gbps |
| NIC:GPU ratio | 1:1 (8 NICs/node) | 1:1 (8 NICs/node) |
| NIC host bus | PCIe Gen5 x16 | PCIe Gen5 x16 |
| MRC support | Yes P4 ASIC FW | Yes HW native |
| SRv6 uN SID | Yes | Yes |
| Packet spray | Yes (Pollara HW) | Yes (CX-8 HW) |
| Coll. lib. | RCCL + MRC shim | NCCL + MRC plugin |
| Fabric BW/node | 3.2 Tbps | 6.4 Tbps |
| Arista 7060XE7 | Compatible | Compatible |
| Next-gen NIC | Vulcano 800G MI400, PCIe Gen6 |
CX-9 (future) |
| Key diff | PCIe switch connects GPUs + NICs. No dedicated NVSwitch-equivalent. | NVSwitch dedicated silicon for GPU mesh. |
MRC is fabric-agnostic. Both implement OCP MRC Rev 1.0: 32-bit EV striped across UDP src port + IPv6 flow label; 128–256 EVs per QP split equally across planes; SRv6 uN SID static source routing; PFC disabled (lossy Ethernet); dynamic routing disabled. The Arista 7060XE7 sees identical traffic from both. Mixed AMD+Nvidia clusters on the same MRC fabric are architecturally valid.
MRC SRv6 fabric — AMD MI350X node → 8 planes → Arista 7060XE7
Pollara 400 — MRC capabilities
Intelligent packet spray — distributes packets across all 8 planes per NIC, 32-bit EV striped across UDP src port + IPv6 flow label · 128–256 EVs per QP, equally split across planes
In-order delivery — every packet carries RDMA virtual address + remote key → NIC writes directly to correct GPU memory location regardless of arrival order
Selective retransmission — SACK indicates precisely which packets arrived · selective retransmit only missing packets · packet trimming: payload stripped, header forwarded as loss signal
Path-aware congestion avoidance — ECN CE bit at leaf (not last hop) → CNP/NACK → Pollara swaps EV to cleaner plane · PFC disabled, runs lossy
Failover — packet loss (not trim) = path failure → EV retired immediately · background probes resurrect recovered paths
SW-programmable P4 ASIC — MRC transport runs in NIC firmware, updatable without hardware swap as spec evolves
SRv6 uN SID encoding — 32-bit EV maps algorithmically to SRv6 address template → specific switch path encoded in uN uSIDs · no dynamic routing in fabric
RCCL + MRC shim — OCP MRC Rev 1.0 ibverbs shim; RCCL workloads run over MRC with no source changes
Port config flexibility: Each Pollara 400 supports 1×400G, 2×200G, or 4×100G. In a multi-plane setup, a single NIC can cover 4 planes at 100G each, or 2 NICs cover all 8 planes at 100G — enabling MRC without a full 8-NIC configuration where cost is a concern.
Vulcano 800G (MI400 — upcoming)
PCIe Gen6 host interface — removes PCIe5 bandwidth bottleneck
800 Gbps per NIC — matches Nvidia CX-8
4×200G or 8×100G multi-plane configs
Up to 2.4 Tbps per GPU (3 NICs per GPU)
With 3× Vulcano NICs per MI400 GPU, AMD moves to a different plane-to-NIC mapping than Nvidia's 1:1. The Arista 7060XE7 fabric topology is unchanged — only the server-side cabling discipline differs.
8
Pollara NICs/node
3.2
Tbps fabric BW
8
MRC planes
400G
per uplink
P4
ASIC (SW prog.)
0
ECMP · no dyn. routing