CONTAP-252261: Drive failure causing long IO delays
Issue
When an drive returns a non-retriable IO error, ONTAP may incorrectly keep retrying the IO. ONTAP will then take a long time to fail the drive resulting in IO delays that may affect clients (like ESX).
--raid.label.io.writeError:notice]: Label write on Disk /aggr1/plex1/rg1/0v.i1.3L1P1 ... failed with storage error disk operation timed out
--wafl_exempt02: wafl.cp.toolong:error]: Aggregate aggr1 experienced a long CP.
--kernel: Nblade.nfsLongRunningOp:debug]: Detected a long running network process operation...