HCI compute host crashed with uncorrectable memory error
Applies to
- NetApp HCI Compute nodes
- VMware ESXi
Issue
- ESXi host experiences a PSOD (purple diagnostics screen) with
uncorrectable memory error for DIMM- Full PSOD examples in Additional Information
- Possible BMC System Event Logs (SEL):
BIOS OEM(Memory Error) (runtime) Failing DIMM: DIMM location. (PX-DIMMXX) - AssertionBIOS OEM(Memory Error) Failing DIMM: DIMM location (Uncorrectable Memory Component Found) (PX-DIMMXX) - AssertionMemory(OEM) Uncorrectable ECC / other uncorrectable memory error @PX-DIMMXX(CPUX) - Assertion[Memory Error] [Memory] Uncorrectable ECC(CPUX_BX) - Asserted"BIOS OEM(Memory Error) Post package repair fail. (PX-DIMMXX) - AssertionBIOS OEM(Memory Error) Memory signal is too marginal. (PX-DIMMXX) - Assertion
Example: from BMC SEL

