Just one node reports increased temperature and the failure of both power supplies
Applies to
Issue
- Just one AFF A250 node reports high chassis temperature and two power supplies failures. Example:
::>event log show -event monitor*,chassis*
Severity Event
--------- ----------------------------------------------
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 In Fault is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Fan is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 FB Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Inlet is Unreadable
ERROR callhome.chassis.hitemp: Call home for CHASSIS OVER TEMPERATURE
EMERGENCY monitor.globalStatus.critical: Power Supply Status Critical: PSU2, PSU1. Chassis temperature is too high..
ERROR monitor.temp.unreadable: The controller temperature (PSU2 FB Hot) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU2 Hot) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU2 Inlet) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU1 FB Hot) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU1 Hot) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU1 Inlet) is not readable.
ERROR callhome.chassis.ps.degraded: Call home for CHASSIS POWER SUPPLY DEGRADED: PS 1
EMERGENCY monitor.globalStatus.critical: Power Supply Status Critical: PSU2, PSU1.
ERROR callhome.chassis.power: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU2, PSU1.
ALERT monitor.chassisPower.degraded: Chassis power is degraded: Power Supply Status Critical: PSU2,PSU1.
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 2 is degraded: PSU2 Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 2 is degraded: PSU2 Inlet is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Out Fault is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Warning is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 In Fault is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Fan is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 FB Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Inlet is Unreadable
- All sensors are ok in the partner node:
::> system controller environment show
Node FRU Name State
------------------ ------------------------------ -----------
netappv17-01 PSU2 GOOD
netappv17-01 PSU1 GOOD
netappv17-02 PSU2 unknown
netappv17-02 PSU1 unknown
- One embedded NSM8E module reports:
[dsa_worker3: ses.status.enclWarn:error]: NS224NSM8E (S/N SHJHU0123456789) shelf 0 on channel 0s disk enclosure warning for Enclosure 1: VPD EEPROMs mismatch or unreadable. This element is on the unknown location.
[dsa_worker3: ses.status.ModuleWarn:alert]: NS224NSM8E (S/N SHJHU0123456789) shelf 0 on channel 0s PCI switch warning for PCI Switch 2: non-critical status; Backplane VPD SEEROM corrupt or unreadable. This element is on the rear of the shelf at the bottom, on shelf module (B).
storage show fault
output:
Enclosure Status: non-critical
Channel: 0s
Shelf: 0
Shelf Type: NS224NSM8E
Module Type: NSM8E
Enclosure:
Element Status Status Bytes Status Descriptions
1: NONCRITICAL 03,00,00,00
PSM:
Element Status Status Bytes Status Descriptions
1 [NSM8E A] : NONCRITICAL 03,0C,00,00 MIDPLANE VPD FAULT, MASTER
2 [NSM8E B] : NONCRITICAL 03,04,00,00 MIDPLANE VPD FAULT
- Issue remains after PSUs and node re-seat.
- Issue remains with a known working PSU.
- Issue follows one chassis slot, when swapping controllers between them.