AFF-C80 Controller Down Due to BMC Unresponsiveness – Emergency Shutdown and Recovery
Applies to
- AFF-C80
- ONTAP 9
- BMC firmware versions 18.4P1
Issue
AFF-C80 cluster became unresponsive and could not boot. Attempts to access the BMC via serial port returned no output and the following EMS log entries were observed:
nodename EMERGENCY monitor.shutdown.emergency: Emergency shutdown: EnvironmentalReasonShutdown (System reboot to recover the SP)nodename ALERT callhome.sp.hbt.stopped: Call home for SP HBT STOPPEDnodename ALERT cf.hwassist.notifyCfgFailed: Failed to update the hardware-assist configuration with hardware component (BMC):HW Assist not configured (4).nodename INFORMATIONAL sp.heartbeat.stopped: Have not received an IPMI heartbeat from the Service Processor (SP) in last 600 seconds.Cluster output showed nodename as “Unknown” and down: