RDMA connection down during ONTAP Upgrade
Applies to
Issue
- During an Ontap Upgrade, one node upgrades the Ontap version, but fails to boot to continue with the takeover/giveback
Cluster::*> cluster image show
Node Current Version Installation Date
-------------------- ----------------------- -------------------
Node-01 9.16.1P6 xx/xx/xxxx 21:24:12
Node-02 9.16.1P11 xx/xx/xxxx 16:43:20
- Checking Storage Failover Show we can see interconnect issues:
Cluster::> sto fail show
(storage failover show)
Node Partner Possible State Description
-------------- -------------- -------- -------------------------------------
Node-01 Node-02 false Waiting for Node-02, Takeover is
not possible: Storage failover
interconnect error, NVRAM log not
synchronized, Disk inventory not
exchanged
Node-02 Node-01 false Waiting for Node-01, Takeover is
not possible: Storage failover
interconnect error, NVRAM log not
synchronized, Disk inventory not
exchanged
- And when we check on the interconnect status we see that the links are up but the RDMA is down
Cluster::*> ha interconnect status show
(system ha interconnect status show)
Node: Node-01
Link 0 Status: up
Link 1 Status: up
Is Link 0 Active: true
Is Link 1 Active: true
IC RDMA Connection: down
Node: Node-02
Link 0 Status: up
Link 1 Status: up
Is Link 0 Active: true
Is Link 1 Active: true
IC RDMA Connection: down
