CFBMC-8474: BMC hang in "pilot4_lowlevel_init" during u-boot initialization causes BMC unresponsiveness
Issue
- On AFF A250 systems with BMC firmware version 15.13, the BMC may become unresponsive during the u-boot initialization process.
- The hang has been observed in the
pilot4_lowlevel_initfunction, and in some cases in theBoot_To_BackupSPIfunction.
- When this occurs, ONTAP may detect the loss of BMC heartbeat and initiate a system shutdown or emergency recovery to prevent hardware damage or data loss.
The BMC typically recovers after a Power On Reset (POR).
Example events indicating the issue include:
[node-01: spmgrd: callhome.sp.hbt.stopped:alert]: Call home for SP HBT STOPPED
[node-01: env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes.