ONTAP FlexArray WAFL hung after large increase in workload
Applies to
- ONTAP 9.x
- E-Series
Issue
- WAFL Panic and node reboot
- E-Series reports fibre channel errors
- Extreme performance degradation
- Very high latency or "not available" for VOL/ LUN's reported
- SCSI transport error against one port "Disk device 0z.1L00"
- Node experienced a long consistency point (CP)
Example EMS Messages :
WAFL hung for <node>. in SK process wafl_exempt07 on release 9.6P5
ERROR scsi.cmd.transportErrorEMSOnly: Disk device : Transport error during execution of command: HA status
wafl.cp.toolong: Aggregate xyz experienced a long CP
mlm.non.optimized.TPusage: Detected non-optimized usage of a array's target port
cf.multidisk.fatalProblem:info]: Node encountered a multidisk error or other fatal error while waiting to be taken over
"Host-side: controller in slot B, port <port>: Fiber channel link errors - threshold exceeded"