CONTAP-350775: Cluster Network Disruption and Unexpected Reboot due to Cluster Session Teardown Deadlock
Issue
A rare deadlock can occur in ONTAP systems during teardown of Cluster Session Manager's (CSM) sessions, resulting in network thread hangs and/or a potentially unexpected reboot. The problem is specific to CSM's Remote Direct Memory Access (RDMA) sessions, which is comprised of multiple connections to remote system. This issue is triggered by a race condition when multiple CSM RDMA connections are closed simultaneously, such as during network instability or cluster port flapping. The defect may lead to remote data access service disruption.
