BMC reboot due to HW watchdog timeout
Applies to
- FAS8300
- FAS8700
- AFF-A400
Issue
- BMC of the node reboots due to hardware watchdog timeout.
- SP-LATEST-STATISTICS
IPMI_Main.c main start 19.62 13.60
procmonitor.c IPMIMain crash 1 times 70.23 98.57
IPMI_Main.c main start 70.77 98.59
IPMI_Main.c main after sync time 73.63 98.70 Sat Aug 31 10:58:32 GMT 2024
BMC init unknown:Sat Aug 31 10:58:37 GMT 2024
GPIO boot : Primary
Physical slot : #1
Primary env : active:#1 inactive:#1
Last boot error : HW watchdog timeout happened last time!
- Events all
654 | 09/15/2024 | 18:43:43 | PEF #0xb5 | SNMP trap successfully sent | Asserted
655 | 09/15/2024 | 18:46:44 | PEF #0xb5 | SNMP trap successfully sent | Asserted
656 | 09/15/2024 | 18:49:42 | PEF #0xb5 | SNMP trap successfully sent | Asserted
657 | 09/15/2024 | 18:52:43 | PEF #0xb5 | SNMP trap successfully sent | Asserted
658 | 09/15/2024 | 18:56:39 | Power Unit #0xb3 | | Asserted | absent FRU: MEZZ_FRU
659 | 09/15/2024 | 18:56:39 | Entity Presence #0x9b | Absent | Asserted
65a | 09/15/2024 | 18:56:53 | Watchdog_Reboot #0xbc | HW watchdog reboot | Asserted
65b | 09/15/2024 | 18:58:47 | PEF #0xb5 | SNMP trap successfully sent | Asserted
65c | 09/15/2024 | 18:59:23 | PEF #0xb5 | SNMP trap successfully sent | Asserted
65d | 09/15/2024 | 18:59:58 | System Event #0xff | Timestamp Clock Sync | Asserted
65e | 09/15/2024 | 19:00:01 | System Event #0xff | Timestamp Clock Sync | Asserted
65f | 09/15/2024 | 19:00:25 | PEF #0xb5 | SNMP trap successfully sent | Asserted
660 | 09/15/2024 | 19:03:25 | PEF #0xb5 | SNMP trap successfully sent | Asserted