CFBMC-4168: SP heartbeat stopped and cannot be recovered leading to takeover and reboot
Issue
- ONTAP events :
[[spmgrd: sp.heartbeat.stopped:error]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 600 seconds.] [[spmgrd: callhome.sp.hbt.missed:notice]: Call home for SP HBT MISSED] [[spmgrd: callhome.sp.hbt.stopped:alert]: Call home for SP HBT STOPPED] [[env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes] [[env_mgr: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (System reboot to recover the BMC)]