Long Consistency Point "wafl.cp.toolong" when disk is about to fail or being failed
Applies to
- ONTAP 8 or newer.
- Disk in the process of failing or failed already.
Issue
- Event visible in the
"event log show"
command output:- ONTAP 9 or newer Example:
Mon Dec 23 00:20:36 EST [FilerA: wafl_exempt08: wafl.cp.toolong:error]: Aggregate fas_01_DATA_AGGR experienced a long CP...
- ONTAP 8 Example:
Mon Feb 22 16:14:08 CLT [FilerA: wafl_CP_proc: wafl.cp.toolong.warning:warning]: params: ...
Mon Feb 22 16:14:08 CLT [FilerA: wafl_CP_proc: wafl.cp.slovol.warning:warning]: params: ...
- ONTAP 9 or newer Example:
- The event log may also list the errors related to the disk.
Tue Oct 31 12:34:10 +1100 [Cluter1_01: disk_server_0: disk.ioRecoveredError.retry:info]: Recovered error on disk 0c.10.6: op0x2f:b5e0b400:0400 sector 7 SCSI:recovered error - Disk used internal retry algorithm to obtain data (1 b 96 96) (2693) Disk 0c.10.6 Shelf 10 Drawer 1 Slot 6 Bay 6 [NETAPP 377_WVAXE10TA07 NA01] S/N
[VHGUH40M] UID [5000CCA0:C82E5848:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]
Tue Oct 31 12:34:10 +1100 [Cluter1_01: disk_server_0: disk.IO.status:debug]: params: {'deviceName': '0c.10.6', 'returnCode': '5',
'pathRetryCount': '0', 'adapterStatus': '0x0', cdb': '0x2f:b5e0b400:0400', 'basicTimeout': '10', 'iASCQ': '0x96', 'iSenseKey': '0x1', 'sSenseCode': '', 'ETime': '2693', 'iASC': '0xb', 'victimRetryCount': 0', 'sSenseKey': 'SCSI:recovered error', 'targetStatus': '0x2', 'disk_information': 'Disk 0c.10.6 Shelf 10 Drawer 1 Slot 6 Bay 6 [NETAPP X377_WVAXE10TA07 NA01] S/N [VHGUH40M] UID 5000CCA0:C82E5848:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]', 'retryCount': '0', 'pathsTried': '1', 'timeoutRetryCount': '0'}
- Latency in affected aggregate is usually detected.
- Consistency Point (CP) is taking longer than expected. More detailed explanation in FAQ: Consistency Point.