Uncorrectable Machine Check Error panic cause by NVME boot Device
Applies to
- AFF-A800 / AFF-C800
- ONTAP 9
Issue
Panic string points to boot device during Uncorrectable Machine Check Error.
PANIC : Uncorrectable Machine Check Error at CPUx. SKL_IIO Error: STATUS<0xb380000000000e0b>(VALID,UC,EN,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0),MCACOD(0xe0b))IIO Machine Check from device(s):Dv[2020](0,0,0): Link down, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controller, PCI Device 8086:a19c on Controller, Seagate M.2 NVMe SSD on Controller, PCI Device 8086:2020 on Controller, PCI Device 8086:2025 on Controlleversion: 9.10.1P1: Fri May 19 21:58:49 EDT 2023
conf : x86_64.optimize
Sysconfig-a:
slot 0: NVMe Boot Media #1 (0x144d,0xa804) Boot Media #1 NA03 GB 512B/sect (A1B2CD3EF45678)