Node reboots with NVDIMM in ERROR condition
Applies to
- AFF A800, AFF C800, ASA A800, ASA C800
- AFF A400, AFF C400, ASA A400, ASA C400
- FAS8300, FAS8700
- ONTAP 9
Issue
- Node goes down and the below errors are observed in the console logs:
CPU initialization.
Running full memory initialization.
DIMM F0: NVDIMM in ERROR condition (status = 00000801).
[nvdimm.nvmem.destage.failure:ALERT]:NVMEM subsystem fail to start or complete destage process, please check battery or NVDIMM..
- Uncorrectable ECC error may be reported in the BMC event logs:
c17 | 05/02/2024 | 05:55:46 | Watchdog 2 #0xb1 | Timer interrupt (NMI/SMS/OS) | Asserted
c18 | 05/02/2024 | 05:55:46 | Critical Interrupt #0xb0 | NMI/Diag Interrupt | Asserted
c19 | 05/02/2024 | 05:55:49 | Watchdog 2 #0xb1 | Hard reset (SMS/OS) | Asserted
c1a | 05/02/2024 | 05:55:49 | Power Unit #0xb2 | Power reset | Asserted | from channel 15
c23 | 05/02/2024 | 05:58:49 | Memory #0x08 | Uncorrectable ECC | Asserted