FAS8300/FAS8700/C400 reboot due to Watchdog 2 Timer expired (OEM)
Applies to
- FAS8300
- FAS8700
- C400
Issue
- Watchdog2 timer expired causes a node to reboot unexpectedly
- The BMC log contains the following information
Example:
The BMC log contains the following information
130 | 04/19/2022 | 00:00:16 | Watchdog 2 Watchdog2 | Timer expired (OEM) | Asserted
131 | 04/19/2022 | 00:02:26 | Watchdog 2 Watchdog2 | Timer expired (OEM) | Asserted
132 | 04/19/2022 | 00:02:37 | Power Unit Power | Power on | Asserted | from channel 1
133 | 04/19/2022 | 00:02:38 | PEF PEF | SNMP trap successfully sent | Asserted
134 | 04/19/2022 | 00:05:02 | Watchdog 2 Watchdog2 | Timer expired (OEM) | Asserted
135 | 04/19/2022 | 00:05:03 | Power Unit Power | Power on | Asserted | from channel 1
136 | 04/19/2022 | 00:05:04 | PEF PEF | SNMP trap successfully sent | Asserted
137 | 01/01/2000 | 00:01:34 | System Event | Timestamp Clock Sync | Asserted
138 | 01/01/2000 | 00:01:34 | System Event #0xff | Timestamp Clock Sync | Asserted
139 | 04/19/2022 | 00:07:26 | System Event #0xff | Timestamp Clock Sync | Asserted
13a | 04/19/2022 | 00:07:26 | System Event | Timestamp Clock Sync | Asserted
13b | 04/19/2022 | 00:07:26 | System Firmware Progress | Secondary CPU Initialization | Asserted
13c | 04/19/2022 | 00:07:26 | System Firmware Progress | USB resource configuration | Asserted
13d | 04/19/2022 | 00:07:31 | System Firmware Progress | PCI resource configuration | Asserted
13e | 04/19/2022 | 00:07:31 | System Firmware Progress | Video initialization | Asserted
13f | 04/19/2022 | 00:07:31 | System Firmware Progress | Keyboard controller initialization | Asserted
140 | 04/19/2022 | 00:07:31 | System Firmware Progress | Hard-disk initialization | Asserted
141 | 04/19/2022 | 00:07:39 | System Firmware Progress | Hard-disk initialization | Asserted
142 | 04/19/2022 | 00:07:41 | Power Unit Power | Power on | Asserted | from channel 1
143 | 04/19/2022 | 00:07:41 | PEF PEF | SNMP trap successfully sent | Asserted
144 | 04/19/2022 | 00:09:00 | Power Unit Power | Power on | Asserted | from channel 1
145 | 04/19/2022 | 00:09:00 | PEF PEF | SNMP trap successfully sent | Asserted
146 | 04/19/2022 | 00:09:57 | Power Unit Power | Power on | Asserted | from channel 1
147 | 04/19/2022 | 00:09:58 | PEF PEF | SNMP trap successfully sent | Asserted
148 | 04/19/2022 | 00:10:06 | Watchdog 2 Watchdog2 | Timer expired (BIOS FRB2) 0x00 | Asserted
- The node restarts repeatedly from the console log
Example:
BIOS Version: 16.6
PEI start.
CPU PEI initialization.
Wait BMC self-test result.
BMC self-test: OK.
UPI initialization.
CPU initialization.
Running full memory initialization.
DIMM F0: NVDIMM with valid data and restage progressing.....
DIMM F0: NVDIMM RESTAGE Success+++++++++