CONTAP-682548: Hyper-V VMs hang during ONTAP upgrade due to CSM session failure and CIFS lock drops
Issue
- Hyper-V virtual machines (VMs) become unresponsive during an ONTAP cluster upgrade, specifically during the giveback phase of a node failover
- All Hyper-V hosts accessing CIFS/SMB shares lose storage connectivity and VMs hang
- Forced reboot of Hyper-V hosts is required to recover VM responsiveness
- The following ONTAP EMS messages are observed:
lmgr.session.failed: Dropped session-scoped locks due to a failed session
csm.createSessionFailed: CSM failed to create session with RDMA_RoCEv2
csm.ctFallbackActiveOpen: CSM could not create RDMA connections, falling back to CT (TCP)
csm.sessionFailed: Session failed with INCONSISTENT_TRANSPORT or CSM_CONNABORTED
- The following Windows Event Log entries are observed on Hyper-V hosts:
Disk Event ID 153 – I/O operation retried due to timeout
Kernel-Power Event ID 41 / 6008 – Unexpected system shutdown
