Node reports CriticalPSUFruOffAlert in system health alerts due to failed PSU
Applies to
- ONTAP 9
- FAS/AFF Systems
Issue
- Node reports
CriticalPSUFruOffAlert
in health alerts:
ClusterA::> system health alert show -instance
Node: Node-01
Monitor: chassis
Alert ID: CriticalPSUFruOffAlert
Alerting Resource: xxxxxxx
Subsystem: Environment
Indication Time: Fri May 20 02:08:21 2022
Severity: Critical
Probable Cause: Loss_of_redundancy
Description: PSU1 is off. The nodes in this chassis are Node-01, Node-02.
Corrective Actions: 1. Check PSU1 and switch it on.
2. Refer to the Hardware specification guide for more information on the position of the power supply unit (PSU) and ways to check or replace it.
3. Contact support personnel if the alert persists.
- The following events are reported in the event logs:
[Node-01: env_mgr: monitor.chassisPowerSupply.off:notice]: Chassis power supply 1 off.
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Temperature is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Current is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan1 Speed is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan1 Fault is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan2 Speed is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan2 Fault is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Pwr Out OK is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fault is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Over Temp is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Over Volt is Unreadable
[Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Over Curr is Unreadable
[Node-01: power_low_monitor: monitor.chassisPower.degraded:alert]: Chassis power is degraded: Power Supply Status Critical: PSU1.
[Node-01: power_low_monitor: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU1.
- SP reports the PSU sensors as
na
insystem sensors
output:
SP Node-01> system sensors
Sensor Name | Current | Unit | Status | LCR | LNC | UNC | UCR
-----------------+------------+------------+------------+-----------+-----------+-----------+-----------
PSU1_Present | 0x0 | discrete | Present | na | na | na | na
PSU1_Temp | na | degrees C | na | 0.000 | 5.000 | 50.000 | 60.000
PSU1_Curr | na | Amps | na | na | na | na | na
PSU1_Fan1_Speed | na | RPM | na | 4500.000 | 4600.000 | na | na
PSU1_Fan1_Fault | na | discrete | na | na | na | na | na
PSU1_Fan2_Speed | na | RPM | na | 4500.000 | 4600.000 | na | na
PSU1_Fan2_Fault | na | discrete | na | na | na | na | na
PSU1_Status_OK | na | discrete | na | na | na | na | na
PSU1_Pwr_In_OK | 0x0 | discrete | Deasserted | na | na | na | na
PSU1_Pwr_Out_OK | na | discrete | na | na | na | na | na
PSU1_Fault | na | discrete | na | na | na | na | na
PSU1_Input_Type | na | discrete | na | na | na | na | na
PSU1_Over_Temp | na | discrete | na | na | na | na | na
PSU1_Over_Volt | na | discrete | na | na | na | na | na
PSU1_Over_Curr | na | discrete | na | na | na | na | na
- The issue persists even after reseating the affected PSU.