Multiple chassis FAN/PSU reported as failed by a single node in shared chassis
Applies to
- ONTAP 9
- AFF-A300
- FAS8200
Issue
- Multiple PSU's and Chassis FANs were reported as failed via a single node, While the partner node sharing the chassis reported the FRU's to be healthy
- The event log lists the error as mentioned below:
Fri Aug 16 16:24:06 +0900 [Node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Ambient Temp 1) is not readable.
Fri Aug 16 16:24:06 +0900 [Node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Ambient Temp 2) is not readable.
Fri Aug 16 16:24:37 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan1 Speed is Unreadable
Fri Aug 16 16:24:37 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan1 Fault is Unreadable
Fri Aug 16 16:24:37 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan2 Speed is Unreadable
Fri Aug 16 16:24:37 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Fan2 Fault is Unreadable
Fri Aug 16 16:24:37 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Over Temp is Unreadable
Fri Aug 16 16:24:37 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Over Volt is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Over Curr is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Crest Factor is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 InPower Monitor is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Temperature is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Current is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Fan1 Speed is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Fan1 Fault is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Fan2 Speed is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Fan2 Fault is Unreadable
Fri Aug 16 16:24:38 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Over Temp is Unreadable
Fri Aug 16 16:24:39 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Over Volt is Unreadable
Fri Aug 16 16:24:39 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Over Curr is Unreadable
Fri Aug 16 16:24:39 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Crest Factor is Unreadable
Fri Aug 16 16:24:39 +0900 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 InPower Monitor is Unreadable
Fri Aug 16 16:24:44 +0900 [Node-01: power_low_monitor: monitor.chassisPower.degraded:alert]: Chassis power is degraded: Power Supply Status Critical: PSU1, PSU2.
Fri Aug 16 16:24:44 +0900 [Node-01: power_low_monitor: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU1, PSU2.
- SP/BMC is on the latest version
- Issue persists even after
- Reboot SP/BMC
- Takeover/Giveback -OR- Takeover, Motherboard reseat, Giveback the node reporting fault
