ONTAP system health and environment reporting degraded, not-available and failed PSU
Applies to
- AFF/FAS platforms
- Power Supplies
Issue
- ONTAP CLI reports:
::> system health subsystem show
Environment degraded
::> system node environment sensors show
Node Sensor State Value/Units Crit-Low Warn-Low Warn-Hi Crit-Hi
PSU1 VIN not-available - mV 90480 93600 261040 263120
PSU1 VOUT not-available - mV 11336 11440 12948 13156
PSU1 Curr IIN not-available - mA 0 - 9984 12012
PSU1 IOUT not-available - mA 0 - 130000 132000
PSU1 PIN not-available - W 7 14 1611 1796
PSU1 POUT not-available - W 7 14 1611 1796
PSU1 Inlet failed - C 0 - 58 63
PSU1 Hot failed - C 0 - 100 105
PSU1 FB Hot failed - C 0 - 100 105
PSU1 FAN not-available - RPM 800 1200 - -
- The EMS logs continuously report following events:
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 VIN is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 VOUT is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Curr IIN is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 IOUT is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 PIN is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 POUT is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Inlet is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Hot is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 FB Hot is Unreadable
Thu Sep 10 18:31:27 +0200 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 FAN is Unreadable
Thu Sep 10 18:35:56 +0200 [Node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU1 Inlet) is not readable.
Thu Sep 10 18:35:56 +0200 [Node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU1 Hot) is not readable.
Thu Sep 10 18:35:56 +0200 [Node-01: env_mgr: monitor.temp.unreadable:error]: The controller temperature (PSU1 FB Hot) is not readable.
or
hu Mar 06 07:45:07 +0100 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 In Volt is Unreadable
Thu Mar 06 07:45:17 +0100 [Node-01: kernel: csm.createSessionFailed:debug]: Cluster Session Manager (CSM) failed to create session (req=9d9cf0af-0eb9-11eb-9c62-000d3abfd6ad, rsp=localhost:dblade, uniquifier=0e062fa6d8573e93) with transport type CT, session tag CPEER, record state STARTING, CSM error CSM_CONNABORTED, low-level error , socket error 0, and TLS error 2113929222.
Thu Mar 06 07:45:17 +0100 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 In Curr is Unreadable
Thu Mar 06 07:45:17 +0100 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 12V Out Volt is Unreadable
Thu Mar 06 07:45:17 +0100 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 12V Out Curr is Unreadable
Thu Mar 06 07:45:28 +0100 [Node-01: env_mgr: monitor.chassisPowerSupply.off:notice]: Chassis power supply 2 off.
Thu Mar 06 07:45:28 +0100 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 In Pwr is Unreadable
Thu Mar 06 07:45:28 +0100 [Node-01: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 2 is degraded: PSU2 Out Pwr is Unreadable