Controller shutdown with multiple chassis fan failed warnings, but only one chassis fan FRU is showing failed
Applies to
- ONTAP 9
- AFF / ASA / FAS
Issue
- Controller shuts down with multiple fan failure even though only one field-replaceable unit (FRU) needs a replacement.
- Autosupports similar to the following can be generated:
HA Group Notification (MULTIPLE CHASSIS FAN FAILED: System will shut down in 2 minutes) ERROR
HA Group Notification (Health Monitor process nphm: NphmCriticalFanFruFaultAlert[xxxxxxxxxxxx]) CRITICAL
HA Group Notification (Health Monitor process cphm: CriticalFruMultiFaultAlert[xxxxxxxxxxxx]) ALERT
HA Group Notification (CHASSIS FAN FRU FAILED: SysFan1 F1) ERROR
HA Group Notification (CHASSIS FAN FRU FAILED: SysFan1 F2) ERROR
- EMS errors for chassis SysFan failure:
Jun 05 00:35:42 [cluster-n01:monitor.chassisFanFail.xMinShutdown:EMERGENCY]: Multiple Chassis Fan failure: System will shut down in 2 minutes.
Jun 05 00:36:00 [cluster-n01:monitor.globalStatus.critical:EMERGENCY]: Multiple fans has failed: SysFan1 F2, SysFan1 F1.
Jun 05 00:38:07 [cluster-n01:monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (Multiple fans failed)
- EMS errors for chassis IOfan failure:
Feb 27 14:06:51 [cluster-n01:monitor.chassisFanFail.xMinShutdown:EMERGENCY]: Multiple Chassis Fan failure: System will shut down in 2 minutes.
Feb 27 14:07:00 [cluster-n01:monitor.globalStatus.critical:EMERGENCY]: Multiple fans has failed: IOfan3 F1, IOfan3 F2.
Feb 27 14:09:21 [cluster-n01:monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (Multiple IO fans failed)
- SP/BMC system sensors appear FAN1_1 and Fan1_2 are
na
status.
Fan1_1 | na | RPM | na | na | 600.000 | 900.000 | na | na | na
Fan2_1 | 12600.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan3_1 | 12600.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan4_1 | 12600.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan1_2 | na | RPM | na | na | 600.000 | 900.000 | na | na | na
Fan2_2 | 12700.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan3_2 | 12700.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan4_2 | 12700.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
- Already ruled out platform specific issues described in:
- Fan module has been checked and is properly seated in the chassis.