PCI Error NMI panic on root port 0,1,0 with UCorrErr(ECRC) on FAS8200
Applies to
- FAS 8200
- AFF A300
- ONTAP 9
Issue
- Node panics with PCI Error NMI on root port 0,1,0:
PANIC: PCI Error NMI from device(s):ErrSrcID(CorrSrc(0),UCorrSrc(0x8)), RPT(0,1,0): in process idle: cpu11- The following additional errors may also be seen prior to the panic string:
0x000000cc539579ed: Add bad root port 0/1/0 (1)0x000000cc53ea7f3a: 0/1/0 ucerr_status 0x800000x000000cc53ea8423: Recovery process terminated PANIC : PCI Error NMI from device(s):ErrSrcID(CorrSrc(0),UCorrSrc(0x8)), RPT(0,1,0):- SSRAM/pelogs show the uncorrectable error is caused by End-to-end CRC (ECRC):
RZR,1_0.IIO0: GNERR<0x00000040>(P1A), GNFERR<0x00000040>(P1A); RPT(0,1,0): GLB<0x00000002>(NFERR), PTR<0x1>, Status(SigSysErr), DevStatus(NFatal), RootErr(UCor,NFatal), ErrSrcID(CorrSrc(0),UCorrSrc(0x8)), UCorrErr(ECRC), FirstUCorrErr(ECRC), TLPType(4MWrRq)- All boot attempts fail with the same panic.
