ESX Hosts losing access to LUNs regularly along with Aggregate reporting long CP's
Applies to
- ONTAP 9.x
- VMware ESXi 6.7 (Update 3)
- Cisco UCS B200M4/5
Issue
- VMware critical alert for datastore path down
- Poor performance and alerts for SQL virtual machines
vmhba0:C0:T66:L43 is down. Affected datastores: Path redundancy to storage device naa.xyz degraded. Path vmhba1:C0:T78:L10 is down. Affected datastores:
- ONTAP reports long or back2back consistency points (CP)
wafl_exempt00: wafl.cp.toolong:error]: Aggregate_xyz_sata_aggr1 experienced a long CP.
WARNING: ScsiDeviceIO: 12345: Device naa.123456 performance has deteriorated. I/O latency increased from average value of 8000 microseconds to 1000008 microseconds