NetApp HCI - Machine Check Error on H610C/H615C node
Applies to
NetApp H610C / H615C (Compute Node)
Issue
- A node crashed with PSOD
- Dozens of
Correctable ECC
errors leading to:
[Critical] [Memory Error] [Memory] Correctable ECC Error Logging Limit Reached(CPU0_D1)
- IPMI SEL alerts:
[Warning] [Additional MCE Error] [OEM Record C2] ManufacturerID:001C4C, Extra Information : 0 MSCOD:0010 MCACOD:0134
[Critical] [MCERR] [Processor] Uncorrectable Error - Machine Check Error: Bank 1/CPU 1/Core 10 - Asserted