CONTROLLER TAKEOVER COMPLETE AUTOMATIC due to power glitch
Applies to
- ONTAP 9
- AFF systems
- FAS systems
Issue
- The partner Node experienced an automatic takeover, the following can be found in
EMS-LOG-FILE
callhome.sfo.takeover:alert]: Call home for CONTROLLER TAKEOVER COMPLETE AUTOMATIC
cf_takeover: callhome.reboot.takeover:notice]: Call home for PARTNER REBOOT (CONTROLLER TAKEOVER)
cf_takeover: cf.fm.takeoverComplete:notice]: Failover monitor: takeover completed
splog_main: mgr.boot.reason_abnormal:EMERGENCY]: System rebooted due to a power glitch.
splog_main: callhome.reboot.glitch:notice]: Call home for REBOOT (power glitch)
- Shelves also reported power failure seen in
EMS-LOG-FILE
cf_hwassist: cf.hwassist.takeoverTrapRecv:notice]: hw_assist: Received takeover hw_assist alert from partner(cluster1-01), system_down because power_loss.
dsa_worker5: ses.status.psWarning:error]: DS224-12 (S/N SHF#############) shelf 0 on channel 7a power warning for Power supply 2: warning status; DC undervoltage. This module is on the rear of the shelf at the bottom right.
dsa_worker4: ses.status.psWarning:error]: DS224-12 (S/N SHF#############) shelf 10 on channel 9d power warning for Power supply 2: warning status; DC undervoltage. This module is on the rear of the shelf at the bottom right.
Sat Jan 11 00:30:40 +0100 [snes1p208_01: dsa_worker0: callhome.shlf.power.intr:error]: Call home for SHELF POWER INTERRUPTED
- ASUP HA Group Notification also shows following
HA Group Notification (CHASSIS POWER SUPPLY DEGRADED: PSU3) ERROR
HA Group Notification (CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU3.) ERROR
HA Group Notification (SHELF POWER INTERRUPTED) ERROR
HA Group Notification (SHELF_FAULT) ERROR
- Checking the
SP-LATEST-SYSTEM-EVENT-LOG
the following can be seen:
Record 589: Fri Aug 06 19:01:37.000000 2021 [SP.emergency]: System input power lost
Record 590: Thu Jan 01 00:00:49.400961 1970 [IPMI.notice]: 7204 | c0 | OEM: ffff7000ff00 | ManufId: 150300 | SP Power Reset
Record 591: Thu Jan 01 00:00:49.450536 1970 [IPMI.notice]: 7304 | c0 | OEM: fcff70560000 | ManufId: 150300 | POS Register: Power on Reset(Normal Power Cycle)
Record 407: Fri Feb 28 03:34:17.489482 2020 [Agent.notice]: 127.880: 3 : AC Power Loss Signal PSU1 de-asserted
Record 408: Fri Feb 28 03:34:17.489664 2020 [Agent.notice]: 128.100: 4 : AC Power Loss Signal PSU2 de-asserted
Record 409: Fri Feb 28 03:34:17.557708 2020 [Agent.notice]: 196.145: 4 : AC Power Loss Signal PSU2 asserted
Record 410: Fri Feb 28 03:34:17.570049 2020 [Agent.notice]: 208.526: 3 : AC Power Loss Signal PSU1 asserted
Record 411: Fri Feb 28 03:34:17.635848 2020 [Agent.notice]: 274.301: 14 : Attention LED (at Midplane) asserted
Record 412: Fri Feb 28 03:34:23.431854 2020 [Agent.notice]: 070.290: 14 : Attention LED (at Midplane) de-asserted
Record 413: Fri Feb 28 03:34:27.516634 2020 [SP.warning]: AC_OK Low Detected
Record 419: Fri Feb 28 03:39:47.942198 2020 [SP.critical]: Filer Reboots