Skip to main content
NetApp Knowledge Base

System shutdown due to SP HBT STOPPED with KCS errors on boot

Views:
1,426
Visibility:
Public
Votes:
0
Category:
aff-series
Specialty:
hw
Last Updated:

Applies to

  • AFF A250, AFF C250
  • ASA A250, ASA C250
  • FAS500f

Issue

  • Node shuts down due to SP HBT STOPPED:
Sat Aug 19 03:46:24 -0400 [cluster-01: spmgrd: sp.heartbeat.stopped:debug]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 600 seconds.
Sat Aug 19 03:46:24 -0400 [cluster-01: spmgrd: callhome.sp.hbt.missed:debug]: Call home for SP HBT MISSED
Sat Aug 19 03:56:44 -0400 [cluster-01: spmgrd: callhome.sp.hbt.stopped:debug]: Call home for SP HBT STOPPED
Sat Aug 19 03:59:08 -0400 [cluster-01: env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes.
Sat Aug 19 04:09:08 -0400 [cluster-01: env_mgr: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (System reboot to recover the BMC)
  • Partner takes over when the partner reboots:
Sat Aug 19 04:09:33 -0400 [cluster-02: cf_main: cf.fsm.takeover.on.reboot:debug]: Failover monitor: One node initiated automatic takeover after detecting that its partner node is rebooting.
  • Serial console indicates node is at LOADER. When toggling to BMC CLI (Ctrl-G) the console shows output similar to the following:
sh: can't create /sys/module/watchdog_hw/parameters/current_wdt_device: nonexistent directory
sh: can't create /sys/module/watchdog_hw/parameters/current_wdt_device: nonexistent directory
 
KCS cmd(NETFN 0x6, CMD 0x1) failed, ret -2
  • Power cycle of the node has no effect.
    • The node still does not boot and the BMC is still unresponsive.
    • Attempting to boot_ontap from LOADER results in the following on boot:
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
Could not patch the required SMBIOS 1 field 1 with the FRU data.
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
KCS cmd(NETFN 0xa, CMD 0x10) failed, ret -2
Copyright(c) 2021 American Megatrends, Inc. 
��Copyright(c) 2021 American Megatrends, Inc. 
��ERROR: Class:0; Subclass:20000; Operation: 1002
 
Boot Loader version 6.5.8 
Copyright (C) 2000-2003 Broadcom Corporation.
Portions Copyright (C) 2002-2023 NetApp, Inc. All Rights Reserved.
 
KCS cmd(NETFN 0x6, CMD 0x1) failed, ret -2
Resetting BMC from backup FW...
Waiting 30 seconds for BMC to reboot...
KCS cmd(NETFN 0x6, CMD 0x1) failed, ret -2
Copyright(c) 2021 American Megatrends, Inc. 
��ERROR: Class:0; Subclass:20000; Operation: 1002

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.