Management controller unavailable SEL entry due to false positive AC lost event
Applies to
- NetApp H610S storage node
- NetApp H615C compute node
- System Event Log (SEL)
Issue
"AC Lost" SEL event prior the “Management controller unavailable” with sensor number 0x7a
but a storage node did not reboot
SEL Record ID : 00a7
Record Type : 02
Timestamp : 11/24/2022 11/24/2022
Generator ID : 0020
EvM Revision : 04
Sensor Type : Management Subsys Health
Sensor Number : 7a
Event Type : Sensor-specific Discrete
Event Direction : Assertion Event
Event Data : 0306ff
Description : Management controller unavailable
Here are SEL log entries as shown on H615C nodes -
Aug/12/2023 16:06:17 [Information] [Power Unit] [Power Unit] Power Off / Power Down - Deasserted
Aug/12/2023 16:06:12 [Warning] [BMC FW Health] [Management Subsystem Health] Management controller unavailable (BMC hardware watchdog timeout reset) - Asserted
Aug/12/2023 15:36:49 [Information] [Power Unit] [Power Unit] AC Lost - Asserted
Jul/8/2023 15:36:18 [Information] [Power Unit] [Power Unit] Power Off / Power Down - Deasserted
Jul/8/2023 15:36:13 [Warning] [BMC FW Health] [Management Subsystem Health] Management controller unavailable (BMC hardware watchdog timeout reset) - Asserted
Jul/8/2023 15:14:23 [Information] [Power Unit] [Power Unit] AC Lost - Asserted