Node stuck in rebooting during upgrade due to failed BMC upgrade
Applies to
NetApp StorageGRID Appliances
Issue
When upgrading StorageGRID, one storage node gets stuck in rebooting stage.
When trying to access the BMC console, it keeps rebooting.
Node may also get stuck in PGE mode or continuesly reboot.
sg-fw-update.log
reports:[2025-01-15 18:10:00+00:00 SGA] Starting BMC firmware update. DO NOT REBOOT!
[2025-01-15 18:10:00+00:00 SGA] =============================================================================
[2025-01-15 18:10:00+00:00 SGA] Executing 'timeout 300 ./Yafuflash2 -cd -info rom.ima_enc'
[2025-01-15 18:10:00+00:00 SGA]
[2025-01-15 18:14:59+00:00 SGA] ERROR: failed BMC firmware update.
[2025-01-15 18:14:59+00:00 SGA] Executing 'timeout 120 ipmitool bmc reset cold'
[2025-01-15 18:15:00+00:00 SGA] Sent cold reset command to MC
[2025-01-15 18:15:00+00:00 SGA] Waiting for BMC to finish reset.