StorageGRID Appliance reboots unexpectedly due to Uncorrectable Machine Check Exception
Applies to
- NetApp StorageGRID Appliance SG6000
- NetApp StorageGRID Appliance SG6100
- NetApp StorageGRID Appliance SG1000
- NetApp StorageGRID Appliance SG1100
- NetApp StorageGRID Appliance SG100
- NetApp StorageGRID Appliance SG110
Issue
- StorageGRID reports
Unexpected node reboot - BMC reports Uncorrectable machine check exception
- Example:
19 | 12/21/2022 | 12:01:36 |Processor #0x74 | Uncorrectable machine check exception | Asserted
1b | 12/21/2022 | 12:01:42 | Temperature#0xaa | Upper Non-critical going high | Asserted
1c | 12/21/2022 | 12:01:42 | Temperature#0xaa | Upper Critical going high | Asserted
1e | 12/21/2022 | 12:01:47 | Processor #0x74| IERR | Asserted
1f | 12/21/2022 | 12:10:00 | Power Unit #0x77| Power off/down | Asserted
20 | 12/21/2022 | 12:10:08 | Power Unit #0x77| Power off/down | Deasserted
