PPR memory test passed yet the LED fault light stays ON
Applies to
- AFF systems
- AFF A700
- AFF A700s
- AFF A800
- AFF A900
- AFF A400
- FAS systems
- FAS8300
- FAS8700
- FAS9000
- Post Package Repair (PPR) enabled
Issue
Node panicked due to an uncorrectable DIMM error:
PANIC: ECC error at DIMM-14: 2C-0F-1651-150A2A63,ADDR 0x7d28fefd80,(Node(0), Memory controller(0), CH(1), DIMM(0), Rank(1), Bank Group(2), Bank(0x0), in process vifmgr on release 9.7P8 (C) on Thu Dec 31 06:50:11 PST 2020 version: 9.7P8:
- The DIMM was tested by PPR during boot and the DIMM passed the test per BIOS updates for memory reliability and the PPR feature.
- However, upon boot up, ONTAP maintained the fault LED ON for the DIMM.
Verify LED
Note: For clusters with mixed models it is recommended to run the command for each node individually with the -node parameter.
::> system controller service-event show
Node ID Event Location Event Description
---------------- --- ---------------------------------- ---------------------
plata4-1a 1 DIMM in slot 1 on Controller A Uncorrectable ECC