Skip to main content
NetApp Knowledgebase

e0a/e0b link flaps on A300/FAS8200, A200/FAS2600, A220/FAS2700,C190 may cause a Takeover

Views:
3,648
Visibility:
Public
Votes:
3
Category:
fas-systems
Specialty:
hw
Last Updated:

Applies to

  • AFF A300/FAS8200
  • AFF A200/FAS2600 (FAS2620, FAS2650)
  • AFF A220/FAS2700 (FAS2720, FAS2750)
  •  AFF C190
  • ONTAP

Issue

  • Cluster ports e0a/e0b links flap or go down at the same time.

Example:

Tue Oct 03 11:08:31 CEST [node1: ixgbe/e0b: snmp.link.down:info]: Interface 2 is down.
Tue Oct 03 11:08:31 CEST [node1: ixgbe/e0b: netif.linkDown:info]: Ethernet e0b: Link down, check cable.
Tue Oct 03 11:08:31 CEST [node1: ixgbe/e0a: snmp.link.down:info]: Interface 1 is down.
Tue Oct 03 11:08:31 CEST [node1: ixgbe/e0a: netif.linkDown:info]: Ethernet e0a: Link down, check cable.

Tue Oct 03 11:08:32 CEST [node2: ixgbe/e0b: snmp.link.down:info]: Interface 2 is down.
Tue Oct 03 11:08:32 CEST [node2: ixgbe/e0b: netif.linkDown:info]: Ethernet e0b: Link down, check cable.
Tue Oct 03 11:08:32 CEST [node2: ixgbe/e0a: snmp.link.down:info]: Interface 1 is down.
Tue Oct 03 11:08:32 CEST [node2: ixgbe/e0a: netif.linkDown:info]: Ethernet e0a: Link down, check cable.
  • From the command line:

cluster::> network port show -role cluster
(network port show)

Node: node1
Speed(Mbps) Health
Port IPspace Broadcast Domain Link MTU Admin/Oper Status
--------- ------------ ---------------- ---- ---- ----------- --------
e0a Cluster Cluster down 9000 1000/- -
e0b Cluster Cluster down 9000 1000/- -

Node: node2
Speed(Mbps) Health
Port IPspace Broadcast Domain Link MTU Admin/Oper Status
--------- ------------ ---------------- ---- ---- ----------- --------
e0a Cluster Cluster down 9000 1000/- -
e0b Cluster Cluster down 9000 1000/- -
4 entries were displayed.

cluster::storage failover> storage failover show
    Takeover
Node              Partner            Possible      State Description
-------------     --------------     --------      -------------------------------------
cluster-01        cluster-02          false        Connected to cluster-02, Partial
                                                   giveback, Takeover is not possible:
                                                   The version of software running on
                                                   each node of the SFO pair is
                                                   incompatible, NVRAM log not synchronized
cluster-02        cluster-01            -          Waiting for cluster applications to
                                                   come online on the local node
                                                   Offline applications: mgmt, vldb,
                                                   vifmgr, bcomd, crs.

If the ports do not come back up and if Connectivity, Liveliness and Availability Monitor(CLAM) is enabled
  • An out of quorum panic will occur on one of the nodes.

Example:

PANIC  : Received PANIC packet from partner, receiving message is (Coredump and takeover initiated because Connectivity, Liveliness and Availability Monitor (CLAM) has determined this node is out of quorum.

  • The node that panics will be taken over and the surviving node will be serving all data.
If the ports do not come back up and if Connectivity, Liveliness and Availability Monitor(CLAM) is NOT enabled
  • There will not be a takeover and both nodes will go out of quorum.  Neither node will be serving data.

 


 

 

CUSTOMER EXCLUSIVE CONTENT

Registered NetApp customers get unlimited access to our dynamic Knowledge Base.

New authoritative content is published and updated each day by our team of experts.

Current Customer or Partner?

Sign In for unlimited access

New to NetApp?

Learn more about our award-winning Support