ONTAP cluster node reboots unexpectedly
Applies to
- ONTAP 9
- FAS 8700
Issue
- The node reboots unexpectedly with the panic string below:
Example:
Panic String: Uncorrectable Machine Check Error at CPU25. SKL_IIO Error: STATUS<0xbb80000000000e0b>(VALID,UC,EN,MISCV,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0),MCACOD(0xe0b))MISC<0x0000000080000000>(UCR_BUS_LOG(128),UCR_DEVICE_LOG(0),UCR_FUNCTION_LOG(0),UCR_SEGMENT_LOG(0))IIO Machine Check from device(s):RPT(128,0,0):ErrSrcID(CorrSrc(0x8100),UCorrSrc(0x8250)), PLX PCIE 8733 switch on Controller, Br[8733](130,10,0): Link down. in process idle: cpu25 on release 9.13.1P8
- After the giveback operation, the Flash Cache card in slot 6 is reported as missing in the
sysconfig -a
output and marked as unsupported in thesysconfig -ac
output.
Example:
::> node run -node <node name> sysconfig -ac
sysconfig: Card in slot 6 (15-144D-A808-0) is not supported.
- The
sysconfig -P
displays the PCI Hierarchy of all PCI devices in the controller.
::> node run -node <node_name> sysconfig -P
Br[2030](128,0,0): PCI Device 8086:2030 on Controller
LinkCap(MaxLkSp(3),MaxLkWd(4),ASPM(2),L0(3),L1(4),SurpDn,DLAct,Port(0))
LinkStatus(LkSp(3),LkWd(4),SClk,DLAct),
Br[8733](129,0,0): PLX PCIE 8733 switch on Controller
LinkCap(MaxLkSp(3),MaxLkWd(4),ASPM(2),L0(6),L1(2),Port(8))
LinkStatus(LkSp(3),LkWd(4)),
Br[8733](130,10,0): PLX PCIE 8733 switch on Controller
LinkCap(MaxLkSp(3),MaxLkWd(4),ASPM(2),L0(6),L1(2),SurpDn,DLAct,Port(10))
LinkStatus(LkSp(3),LkWd(4),DLAct),
Dv[a808](131,0,0) in slot 6: PCI Device 144d:a808 in slot 6 on Controller