Multiple SHELF_FAULT and SHELF COOLING UNIT FAILED reported by ONTAP
Applies to
- ONTAP 9
- DS460C (DS460-12), NS224NSM100
Issue
- Multiple SHELF_FAULT and SHELF COOLING UNIT FAILED seen in AutoSupports:
HA Group Notification (SHELF_FAULT) ERROR.
HA Group Notification (SHELF COOLING UNIT FAILED) EMERGENCY
- Shelf fault and shelf cooling unit failed errors would be recovered and become normal within a short of times.
Example:
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 1: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 2: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 3: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 4: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 5: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 6: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 7: normal status.
[?] Mon Feb 20 19:47:52 +0800 [n19911002-01: dsa_worker5: ses.status.fanInfo:info]: DS460-12 (S/N xxxx) shelf 20 on channel 0a cooling fan information for Cooling element 8: normal status.
[?] Mon Feb 20 19:48:00 +0800 [n19911002-01: monitor: monitor.globalStatus.ok:notice]: The system's global status is normal.
[?] Tue Feb 21 10:36:51 +0800 [n19911002-01: statd: monitor.shelf.fault.ok:notice]: Fault previously reported on disk storage shelf attached to channel 0a has been corrected.
[?] Tue Feb 21 10:37:00 +0800 [n19911002-01: monitor: monitor.globalStatus.ok:notice]: The system's global status is normal.
- All or some of the cooling units are fault in output of
environment
.
Cooling Unit installed element list: 1, 2, 3, 4, 5, 6, 7, 8; with error: 1, 2, 3, 4, 5, 6, 7, 8