CONTAP-350775: Cluster Network Disruption and Unexpected Reboot due to Cluster Session Teardown Deadlock
Issue
A rare deadlock can occur in ONTAP systems during teardown of Cluster Session Manager's (CSM) sessions, resulting in network thread hangs and potential node panic. The problem is specific to CSM's Remote Direct Memory Access (RDMA) sessions, which is comprised of multiple connections to remote system. This issue is triggered by a race condition when multiple CSM RDMA connections are closed simultaneously, such as during network instability or cluster port flapping. The defect may lead to remote data access service disruption.
