CFBMC-4168: SP heartbeat stopped and cannot be recovered leading to takeover and reboot
Issue
- ONTAP events :
[[spmgrd: sp.heartbeat.stopped:error]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 600 seconds.]
[[spmgrd: callhome.sp.hbt.missed:notice]: Call home for SP HBT MISSED]
[[spmgrd: callhome.sp.hbt.stopped:alert]: Call home for SP HBT STOPPED]
[[env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes]
[[env_mgr: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (System reboot to recover the BMC)]