Skip to main content
NetApp Knowledge Base

CONTAP-350775: Cluster Network Disruption and Unexpected Reboot due to Cluster Session Teardown Deadlock

Views:
2
Visibility:
Public
Votes:
0
Category:
ontap-9
Specialty:
core
Last Updated:

Issue

A rare deadlock can occur in ONTAP systems during teardown of Cluster Session Manager's (CSM) sessions, resulting in network thread hangs and potential node panic. The problem is specific to CSM's Remote Direct Memory Access (RDMA) sessions, which is comprised of multiple connections to remote system. This issue is triggered by a race condition when multiple CSM RDMA connections are closed simultaneously, such as during network instability or cluster port flapping. The defect may lead to remote data access service disruption.

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.