node down following multiple unreadable sensors and fails to boot
Applies to
- AFF-A300
- FAS8200
- BIOS below version 11.20
Issue
- Node shows health alerts on multiple FANs
::*> system health alert show
Node: node-02
Resource: 021907025432
Severity: Critical
Indication Time: Sat Dec 23 22:10:33 2023
Suppress: false
Acknowledge: false
Probable Cause: Fan2 has multiple faults. The nodes in this chassis
are node-02, node-01.
Possible Effect: The FRU Fan2 might stop functioning soon. The nodes in
the chassis might not function effectively or redundancy might be lost.
Corrective Actions: 1. Check Fan2 for failures. If necessary replace Fan2 as soon as possible.
2. Refer to the Hardware specification guide for more information on the position of the field-replaceable unit (FRU) and ways to check or replace it.
3. Contact support personnel if the alert persists.
- Node shows multiple sensors are at faulted state or not available
::*> system node environment sensors show
PSU2 fault MULTIFAULT
PSU1 fault MULTIFAULT
Fan3 fault MULTIFAULT
Fan2 fault MULTIFAULT
Fan1 fault MULTIFAULT
Sysfan2 Present normal PRESENT
Sysfan1 Fault fault FAULT
Sysfan1 F1 Speed failed - RPM 1470 1560 - -
Sysfan1 F2 Speed failed - RPM 1470 1560 - -
Sysfan2 Present normal PRESENT
Sysfan2 Fault fault FAULT
Sysfan2 F1 Speed failed - RPM 1470 1560 - -
Sysfan2 F2 Speed failed - RPM 1470 1560 - -
Sysfan3 Present normal PRESENT
Sysfan3 Fault fault FAULT
Sysfan3 F1 Speed failed - RPM 1470 1560 - -
Sysfan3 F2 Speed failed - RPM 1470 1560 - -
PSU1 Present normal PRESENT
PSU1 Temp not-available - C 0 5 50 60
PSU1 Curr not-available - mA - - - -
PSU1 Fan1 Speed not-available - RPM 4500 4600 - -
PSU1 Fan1 Fault not-available
PSU1 Fan2 Speed not-available - RPM 4500 4600 - -
PSU1 Fan2 Fault not-available
PSU1 Pwr In OK normal OK
PSU1 Pwr Out OK normal OK
PSU1 FAULT normal OK
PSU1 Over Temp not-available
PSU1 Over Volt not-available
PSU1 Over Curr not-available
PSU1 Crest Factor not-available - 1000 - 1728 2000
PSU1 InPwr Monitor not-available - mW - - - -
PSU2 Present normal PRESENT
PSU2 Temp not-available - C 0 5 50 60
PSU2 Curr not-available - mA - - - -
PSU2 Fan1 Speed not-available - RPM 4500 4600 - -
PSU2 Fan1 Fault not-available
PSU2 Fan2 Speed not-available - RPM 4500 4600 - -
PSU2 Fan2 Fault not-available
PSU2 Pwr In OK normal OK
PSU2 Pwr Out OK normal OK
PSU2 FAULT normal OK
PSU2 Over Temp not-available
PSU2 Over Volt not-available
PSU2 Over Curr not-available
PSU2 Crest Factor not-available - 1000 - 1728 2000
PSU2 InPwr Monitor not-available - mW - - - -
- After controller reseat, node reach to
LOADER>
but fails to boot with:
Waiting for SP ...
IPMI:Read midplane FRU 1 common header:failed
Configuring Devices ...
...
BIOS POST Failure(s) detected: Failed to get FRU data. Abort AUTOBOOT
LOADER-B>