Service-Processor degraded caused by stuck/stale SPAutoUpgradeFailedMajorAlert
Applies to
Issue
- SP auto-upgrade fails and node reports:
[node1: nphmd: hm.alert.raised:alert]: Alert Id = SPAutoUpgradeFailedMajorAlert , Alerting Resource = SP Upgrade raised by monitor controller
- SP subsystem status gets degraded:
cluster1::> system health subsystem show
Subsystem Health
----------------- ------------------
SAS-connect ok
Environment ok
Memory ok
Service-Processor degraded
- SP firmware upgrade on the nodes is successful
- Accessing the SP via SSH works fine