ANDU paused due to a T6 card error in a MetroCluster IP
Applies to
- ONTAP 9
- MetroCluster IP
- Automatic Non-Disruptive Upgrade (ANDU)
- Dual 40/100G Ethernet T62100-CR Card
- No takeover
Issue
- Shortly after ANDU ONTAP upgrade on a MetroCluster IP system started, the T6 card on one of the nodes reports a fatal error:
Tue Apr 13 11:00:26 +0100 [NodeA_01: intr: netif.fatal.err:alert]: The network device in slot 1 encountered fatal error e1a/e1b.
- Both ports on the T6 card change their state to
offline
:
slot 1: Dual 40/100G Ethernet T62100-CR
e1a MAC Address: 00:07:43:6f:ee:b0 (auto-unknown-fd-down)
e1b MAC Address: 00:07:43:6f:ee:b8 (auto-unknown-fd-down)
Device Type: T6 2
Firmware Version: 1.25.0.42
Part Number: 110122860E0
Hardware Revision: 0
Serial Number: PT12345678
- ONTAP ANDU is paused due to takeover failed:
[?] Tue Apr 13 11:00:26 +0100 [NodeA_01: upgrademgr: upgrademgr.update.pausedErr:error]: The automated update of the cluster has been paused due to the following reason: Node "NodeA_02": Error: {Takeover failed.}, Action: {Use the "storage failover takeover -ofnode NodeA_01" command to trigger the takeover of the node "NodeA_01".}.
- Both nodes are running and serving data, takeover didn't occur:
ClusterA::> storage failover show
Takeover
Node Partner Possible State Description
-------------- -------------- -------- -------------------------------------
ClusterA-01 ClusterA-02 false Waiting for ClusterA-02, Takeover is
not possible: Storage failover
mailbox disk state is invalid,
Storage failover interconnect error,
NVRAM log not synchronized
ClusterA-02 ClusterA-01 false Waiting for ClusterA-01, Takeover is
not possible: Storage failover
interconnect error, NVRAM log not
synchronized
2 entries were displayed.