Skip to main content
NetApp Knowledge Base

Handling L2 Watchdog Resets on the AFF/ASA A800, C800 Platform

Views:
1,943
Visibility:
Public
Votes:
0
Category:
aff-series
Specialty:
hw
Last Updated:

Applies to

  • AFF A800, AFF C800, ASA A800, ASA C800

Issue

  • Node reboots unexpectedly
  • Node does not reboot after an unexpected shutdown

BMC logs on the impacted node show the following: 

[ASUP.notice]: First notification email | (REBOOT (abnormal)) WARNING | Send failed
[IPMI.notice]: 0076 | 02 | EVT: 6fc302ff | System_Watchdog | Assertion Event, "Power cycle"
[IPMI Event.critical]: L2 watchdog timeout power cycle
Or
[IPMI.notice]: 0e1a | 02 | EVT: 6fc824ff | System_Watchdog | Assertion Event, "Timer interrupt"
[IPMI Event.critical]: NMI
[IPMI.notice]: 0e1b | 02 | EVT: 6f00ffff | CriticalInt | Assertion Event, "NMI/Diag Interrupt"
[IPMI.notice]: 0e1c | 02 | EVT: 6fc124ff | System_Watchdog | Assertion Event, "Hard reset"
[IPMI Event.critical]: L2 watchdog timeout hard reset
[IPMI Event.critical]: System reset
[IPMI Event.critical]: L2 watchdog action completed

  • If node reboots, the following errors can be seen in the EMS log files

Thu May 05 15:33:43 +0800 [netapp: splog_main: mgr.boot.reason_abnormal:EMERGENCY]: System rebooted due to a watchdog reset.
Thu May 05 15:33:43 +0800 [netapp: splog_main: callhome.reboot.watchdog:alert]: Call home for REBOOT (watchdog reset)

Thu May 05 15:33:43 +0800 [netapp: cf_hwassist: cf.hwassist.takeoverTrapRecv:notice]: hw_assist: Received takeover hw_assist alert from partner(node), system_down because l2_watchdog_reset.

  • If node is unable to reboot, system senors from the BMC may show the Attn_Sensor1 as Asserted

PCI_SW1_Err      | 0x0        | discrete   | Deasserted | na        | na        | na        | na        
Wrench_Port_Up   | 0x0        | discrete   | Enabled    | na        | na        | na        | na        
SysReset         | 0x0        | discrete   |            | na        | na        | na        | na        
System_Watchdog  | 0x0        | discrete   |            | na        | na        | na        | na        
Attn_Sensor1     | 0x0        | discrete   | Asserted   | na        | na        | na        | na 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.