T6 cluster port hung leading to cluster network issues
Applies to
- AFF A800
- e0a/e0b onboard ports
Issue
- Affected node's cluster port was reporting admin up but reporting degraded and not functioning correctly
Node: Node1
Ignore
Speed(Mbps) Health Health
Port IPspace Broadcast Domain Link MTU Admin/Oper Status Status
--------- ------------ ---------------- ---- ---- ----------- -------- ------
e0M Default mgmt up 1500 auto/1000 healthy false
e0a Cluster Cluster up 9000 auto/100000 degraded false
- Cluster LIF home on this port was migrated to the failover target:
Node1_clus1 up/up 169.254.123.123/16 Node1 e1a false
Node1_clus2 up/up 169.254.124.124/16 Node1 e1a true
- Attempt to bounce the port results in the following error:
cluster::*> net port modify -node Node1 -port e0a -up-admin false
(network port modify)
Warning: Changes to a cluster port can affect the health of the Cluster. Are you sure you want to continue? {y|n}: y
Error: command failed: Timeout: Operation "vifmgr_netports_iterator::modify_imp()" took longer than 45 seconds to complete [from mgwd on node "node1" (VSID: -1) to vifmgr at 127.0.0.1]
- The following error is seen in the console logs:
destroy_cq - Device t6nex0 not responding after 60.000779 seconds - tid 0 qpid 0
destroy_cq - Device t6nex0 not responding after 300.019900 seconds - tid 0 qpid 0
destroy_cq - Device t6nex0 not responding after 1260.040793 seconds - tid 0 qpid 0
destroy_cq - Device t6nex0 not responding after 5100.061604 seconds - tid 0 qpid 0
destroy_cq - Device t6nex0 not responding after 20460.081438 seconds - tid 0 qpid 0