PPR memory test passed yet the LED fault light stays ON for an AFF A700s
Applies to
- AFF A700s
- ONTAP 9
- Post Package Repair (PPR)
Issue
Node panicked due to an uncorrectable DIMM error:
PANIC: ECC error at DIMM-14: 2C-0F-1651-150A2A63,ADDR 0x7d28fefd80,(Node(0), Memory controller(0), CH(1), DIMM(0), Rank(1), Bank Group(2), Bank(0x0), in process vifmgr on release 9.7P8 (C) on Thu Dec 31 06:50:11 PST 2020 version: 9.7P8:
- The DIMM was tested by PPR during boot and the DIMM passed the test per BIOS updates for memory reliability and the PPR feature.
- However, upon boot up, ONTAP maintained the fault LED ON for the DIMM.
Verify LED
Note: For clusters with mixed models it is recommended to run the command for each node individually with the -node parameter.
::> set advanced
::*> system controller fru led show
Node FRU Type Bay Slot State Lit By
--------------------- ----------- --- ---- ------- -------
node1
controller A 1 on SP
ioxm B 1 off -
dimm - 2 off -
dimm - 3 on SP