Power Supply Status Critical due to faulty motherboard
Applies to
- FAS8200
- AFF-A300
Issue
- Takeover occurs due to no heartbeat was detected.
[cf.fsm.takeover.noHeartbeat:ALERT]: Failover monitor: Takeover initiated after no heartbeat was detected from the partner node.
[cf.fm.takeoverComplete:notice]: (EMS parameters: token="XXXXXXXXXXX_13:44:29_2024:12:14" partner_node_uuid="XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX")
- EMSshow PSU error and auto recover frequently.
[node1: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Temperature is Unreadable
[node1: power_low_monitor: monitor.chassisPower.degraded:alert]: Chassis power is degraded: Power Supply Status Critical: PSU1.
[node1: power_low_monitor: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU1.
[node1: monitor: monitor.globalStatus.critical:EMERGENCY]: Power Supply Status Critical: PSU1.
[node1: spsm_listener: sp.heartbeat.stopped:error]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 20 seconds.
[node1: spsm_listener: sp.heartbeat.resumed:info]: Received IPMI heartbeat from the Service Processor (SP).
[node1: power_low_monitor: monitor.chassisPowerSupplies.ok:info]: Chassis power supplies OK.
[node1: monitor: monitor.globalStatus.ok:notice]: The system's global status is normal.
- 
    SP-LATEST-IPMIshow multiple onboard sensor innot_availablestatus.
Fan Override                            NORMAL
PSU1 Present                            PRESENT
PSU1 Temp                not_available     -- C         0 C         5 C        50 C        60 C
PSU1 Curr                not_available     -- mA       --          --          --          --
PSU1 Fan1 Speed          not_available     -- RPM    4500 RPM    4600 RPM      --          --
PSU1 Fan1 Fault          not_available      --
PSU1 Fan2 Speed          not_available     -- RPM    4500 RPM    4600 RPM      --          --
PSU1 Fan2 Fault          not_available      --
PSU1 Pwr In OK                              OK
PSU1 Pwr Out OK                             OK
PSU1 FAULT                                  OK
PSU1 Input Type          not_available      --
PSU1 Over Temp           not_available      --
PSU1 Over Volt           not_available      --
PSU1 Over Curr           not_available      --
PSU1 Crest Factor        not_available     --        1000          --        1728        2000
PSU1 InPwr Monitor       not_available     -- mW       --          --          --          --
PSU2 Present                            PRESENT
PSU2 Temp                not_available     -- C         0 C         5 C        50 C        60 C
PSU2 Curr                not_available     -- mA       --          --          --          --
PSU2 Fan1 Speed          not_available     -- RPM    4500 RPM    4600 RPM      --          --
PSU2 Fan1 Fault          not_available      --
PSU2 Fan2 Speed          not_available     -- RPM    4500 RPM    4600 RPM      --          --
PSU2 Fan2 Fault          not_available      --
PSU2 Pwr In OK                              OK
PSU2 Pwr Out OK                             OK
PSU2 FAULT                                  OK
PSU2 Input Type          not_available      --
PSU2 Over Temp           not_available      --
PSU2 Over Volt           not_available      --
PSU2 Over Curr           not_available      --
PSU2 Crest Factor        not_available     --        1000          --        1728        2000
PSU2 InPwr Monitor       not_available     -- mW       --          --          --          --
Bat Present                             PRESENT
- Partner node doesn't have same issue and report PSU status as normal.
