ANDU fails due to Error: Takeover failed
Applies to
- ONTAP 9
- Automated Non-Disruptive Upgrade (ANDU)
Issue
- ANDU on node1 fails due to
Error: Takeover failed., Action: Use the "storage failover takeover -ofnode node1" command to trigger the takeover of the node "node1"
. - Auto-triggered ASUPs are as below.
HA Group Notification (REBOOT (after giveback)) NOTICE node2 9.9.1P16 Cluster-Mode
HA Group Notification (AUTOMATED NDU PAUSED) ALERT node2 9.9.1P16 Cluster-Mode
HA Group Notification (AUTOMATED NDU PAUSED) ALERT node1 9.8P5 Cluster-Mode
HA Group Notification (HA GROUP ERROR: DISK/SHELF COUNT MISMATCH) ERROR node2 9.9.1P16 Cluster-Mode
HA Group Notification (HA GROUP ERROR: DISK/SHELF COUNT MISMATCH) ERROR node1 9.8P5 Cluster-Mode
HA Group Notification (FILESYSTEM DISK NOT RESPONDING) ERROR node2 9.9.1P16 Cluster-Mode
HA Group Notification (CONTROLLER GIVEBACK COMPLETE) NOTICE node1 9.8P5 Cluster-Mode
HA Group Notification (PARTNER REBOOT (CONTROLLER TAKEOVER)) NOTICE node2 9.8P5 Cluster-Mode
HA Group Notification (CONTROLLER TAKEOVER COMPLETE MANUAL) NOTICE node1 9.8P5 Cluster-Mode
storage failover show –instance –node node2
or ASUP ->STORAGE-FAILOVER.XML
of node2 showsReason Takeover not Possible
as below:- The version of software running on each node of the SFO pair is incompatible
- NVRAM log not synchronized
- Local node missing partner disks
- ASUP ->
SYSCONFIG-R
shows below:
Broken disks
RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks)
--------- ------ ------------- ---- ---- ---- ----- -------------- --------------
not responding 0b.00.6P2 0b 0 6 SA:B 0 SAS 10000 23956/49062912 23964/49079296
not responding 0b.00.6 0b 0 6 SA:B 0 SAS 10000 1713523/3509295616 1716957/3516328368