Node unhealthy after controller upgrade
Applies to
- Controller upgrade
- AFF8060 to AFF A700
- ONTAP 9.7P6
- Cisco Nexus 3132 Cluster Network Switches
Issue
- After completing controller upgrade from AFF8060 to AFF A700 the new shows not healthy
cluster::> cluster show
Node Health Eligibility
--------------------- ------- ------------
node01 false true
node02 true true
node03 true true
node04 true true
- Cluster applications fail to come online on the new controller
cluster::> storage failover show
Takeover
Node Partner Possible State Description
-------------- -------------- -------- -------------------------------------
node01 - false Waiting for partner. Waiting for
cluster applications to come online
on the local node. Offline
applications: mgmt, vldb, vifmgr,
bcomd, crs, scsi blade, clam.
Takeover is not possible: Storage
failover is disabled
node02 node01 false Waiting for node01, Takeover is
not possible: Storage failover is
disabled
node03 node04 true Connected to node04
node04 node03 true Connected to node03
- Connecting directly to the node management LIF of the replaced node shows all nodes in the cluster unhealthy
cluster::> cluster show
Node Health Eligibility
--------------------- ------- ------------
node01 false true
node02 false true
node03 false true
node04 false true
- Migrating all cluster LIFs over a single switch brings the node healthy again