CHW-1103: PSU degraded alert due to sensors unreadable
Issue
AFF-A250 systems might report CHASSIS POWER SUPPLY DEGRADED and CHASSIS OVER
TEMPERATURE alerts, although partner nodes report PSUs as normal.
This occurs even when the EMS log show that the PSU sensors are unreadable.
EMS log:
env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Inlet is Unreadable
env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Hot is Unreadable
env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 FB Hot is Unreadable
env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan is Unreadable
env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Inlet is Unreadable
env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Hot is Unreadable
env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 FB Hot is Unreadable
env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Fan is Unreadable
power_low_monitor: monitor.chassisPower.degraded:alert]: Chassis power is degraded: Power Supply Status Critical: PSU2, PSU1.
power_low_monitor: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU2, PSU1.
env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU1 Inlet) is not readable.
env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU1 Hot) is not readable.
env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU1 FB Hot) is not readable.
env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU2 Inlet) is not readable.
env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU2 Hot) is not readable.
env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU2 FB Hot) is not readable.
env_mgr: callhome.chassis.hitemp:error]: Call home for CHASSIS OVER TEMPERATUREPLATFORM-SENSORS.XML
PSU1 fru failed failed
PSU2 fru failed failed