Chassis PSU repeatedly reports fan RPM low and then back to normal values
Applies to
- AFF A800
Issue
- The following errors are continuously raised and cleared
[?] Fri Mar 18 05:16:55 +0200 [node1: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 FAN is Warning Low (1100 RPM)
[?] Fri Mar 18 05:17:00 +0200 [node1: power_low_monitor: monitor.chassisPower.degraded:alert]: Chassis power is degraded: Power Supply Status Critical: PSU2.
[?] Fri Mar 18 05:17:00 +0200 [node1: power_low_monitor: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU2.
[?] Fri Mar 18 05:17:00 +0200 [node1: monitor: monitor.globalStatus.critical:EMERGENCY]: Power Supply Status Critical: PSU2.
- PSU fan sensor is flapping between the Lower Non Critical and Lower Critical values of 1200 and 800 RPM respectively and then returning back above the threshold:
Record 146: Fri Mar 18 12:39:49.810000 2022 [IPMI.notice]: 0a31 | 02 | EVT: 01520708 | PSU2_FAN | Assertion Event, "Lower Critical going low " | Reading: 700.000 | Threshold: 800.000
Record 147: Fri Mar 18 12:40:07.900000 2022 [IPMI.notice]: 0a32 | 02 | EVT: 81520908 | PSU2_FAN | Deassertion Event, "Lower Critical going low " | Reading: 900.000 | Threshold: 800.000
Record 148: Fri Mar 18 12:40:09.910000 2022 [IPMI.notice]: 0a33 | 02 | EVT: 81500f0c | PSU2_FAN | Deassertion Event, "Lower Non-critical going low " | Reading: 1500.000 | Threshold: 1200.000
Record 149: Fri Mar 18 13:08:16.310000 2022 [IPMI.notice]: 0a34 | 02 | EVT: 0150080c | PSU2_FAN | Assertion Event, "Lower Non-critical going low " | Reading: 800.000 | Threshold: 1200.000
Record 150: Fri Mar 18 13:08:16.310000 2022 [IPMI.notice]: 0a35 | 02 | EVT: 01520808 | PSU2_FAN | Assertion Event, "Lower Critical going low " | Reading: 800.000 | Threshold: 800.000
Record 151: Fri Mar 18 13:08:18.330000 2022 [IPMI.notice]: 0a36 | 02 | EVT: 81520d08 | PSU2_FAN | Deassertion Event, "Lower Critical going low " | Reading: 1300.000 | Threshold: 800.000
Record 152: Fri Mar 18 13:08:18.330000 2022 [IPMI.notice]: 0a37 | 02 | EVT: 81500d0c | PSU2_FAN | Deassertion Event, "Lower Non-critical going low " | Reading: 1300.000 | Threshold: 1200.000
Record 153: Fri Mar 18 13:15:42.550000 2022 [IPMI.notice]: 0a38 | 02 | EVT: 01500c0c | PSU2_FAN | Assertion Event, "Lower Non-critical going low " | Reading: 1200.000 | Threshold: 1200.000
Record 154: Fri Mar 18 13:15:44.560000 2022 [IPMI.notice]: 0a39 | 02 | EVT: 81500d0c | PSU2_FAN | Deassertion Event, "Lower Non-critical going low " | Reading: 1300.000 | Threshold: 1200.000
Record 155: Fri Mar 18 13:20:54.100000 2022 [IPMI.notice]: 0a3a | 02 | EVT: 01500c0c | PSU2_FAN | Assertion Event, "Lower Non-critical going low " | Reading: 1200.000 | Threshold: 1200.000
Record 156: Fri Mar 18 13:20:58.120000 2022 [IPMI.notice]: 0a3b | 02 | EVT: 01520708 | PSU2_FAN | Assertion Event, "Lower Critical going low " | Reading: 700.000 | Threshold: 800.000
Record 157: Fri Mar 18 13:21:06.160000 2022 [IPMI.notice]: 0a3c | 02 | EVT: 81520c08 | PSU2_FAN | Deassertion Event, "Lower Critical going low " | Reading: 1200.000 | Threshold: 800.000
Record 158: Fri Mar 18 13:21:10.180000 2022 [IPMI.notice]: 0a3d | 02 | EVT: 8150100c | PSU2_FAN | Deassertion Event, "Lower Non-critical going low " | Reading: 1600.000 | Threshold: 1200.000
Record 159: Fri Mar 18 13:40:56.090000 2022 [IPMI.notice]: 0a3e | 02 | EVT: 01500a0c | PSU2_FAN | Assertion Event, "Lower Non-critical going low " | Reading: 1000.000 | Threshold: 1200.000
Record 160: Fri Mar 18 13:40:58.100000 2022 [IPMI.notice]: 0a3f | 02 | EVT: 01520808 | PSU2_FAN | Assertion Event, "Lower Critical going low " | Reading: 800.000 | Threshold: 800.000
Record 161: Fri Mar 18 13:41:00.110000 2022 [IPMI.notice]: 0a40 | 02 | EVT: 81520908 | PSU2_FAN | Deassertion Event, "Lower Critical going low " | Reading: 900.000 | Threshold: 800.000
Record 162: Fri Mar 18 13:41:02.120000 2022 [IPMI.notice]: 0a41 | 02 | EVT: 8150100c | PSU2_FAN | Deassertion Event, "Lower Non-critical going low " | Reading: 1600.000 | Threshold: 1200.000
Record 163: Fri Mar 18 13:48:32.200000 2022 [BMC CLI.notice]: guest "log in from 8.8.8.8"
Affected fan PSU is running at lower RPM than the rest of the fans:
PSU1_FAN | 8300.000 | RPM | ok | 800.000 | 1200.000 | na | na
PSU2_FAN | 3900.000 | RPM | ok | 800.000 | 1200.000 | na | na