High latency during disk failure in ONTAP MCC-IP with ADP Partitioned Disks
Applies to
- ONTAP 9
- MetroCluster IP(MCC-IP)
- Advanced Disk Partitioning (ADP)
Issue
- During a disk failure event in an MCC-IP configuration with ADP, the affected volumes experienced high write latency.
- SQL Server reported I/O requests taking longer than 15 seconds, leading to transaction errors and temporary service disruptions.
- Example log output:
Mon Sep 15 18:53:19 +0800 [SiteB-01:scsi_cmdblk_strthr_admin:disk.timeout.flush.start:notice]: Aggressive timeout flush started on disk 0v.i2.1L31 S/N 44L0A17HTZ1H. Details: PowerCycled: true Reason: DiskSpasm.Mon Sep 15 18:53:55 +0800 [SiteB-01:config_thread:raid.disk.offline:notice]: Marking Disk /SiteB_DATA_1/plex2/rg0/0v.i2.1L31P1 Shelf 0 Bay 25 offline.- SQL Server Error Log:
SQL Server has encountered 1 occurrence(s) of I/O requests taking longer than 15 seconds to complete on file templog.ldf (transaction log).