sas.device.timeout error occurs after disk.ioRecoveredError
Applies to
Issue
IO recover occurs after IO problem.
[?] Tue Jul 23 17:25:02 +0900 [NodeA: disk_server_0: disk.ioRecoveredError.retry:info]: Recovered error on disk 0x.XX.XX: op 0x2f:37776800:0400 sector 0 SCSI:recovered error - Disk used internal retry algorithm to obtain data (1 b 4 0) (49) Disk 0x.XX.XX Shelf X Bay XX [NETAPP X342_SSKBE1T2A10 NA02] S/N [XXXXXXXX] UID [X000XX00:XXXXXXXX:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]
[?] Tue Jul 23 17:25:02 +0900 [NodeA: disk_server_0: disk.IO.status:debug]: params: {'deviceName': '0x.XX.XX', 'ETime': '49', 'cdb': '0x2f:37776800:0400', 'victimRetryCount': '0', 'retryCount': '0', 'timeoutRetryCount': '0', 'pathRetryCount': '0', 'adapterStatus': '0x0', 'targetStatus': '0x2', 'sSenseKey': 'SCSI:recovered error', 'sSenseCode': '', 'iSenseKey': '0x1', 'iASC': '0xb', 'iASCQ': '0x4', 'pathsTried': '1', 'basicTimeout': '5', 'returnCode': '5', 'disk_information': 'Disk 0x.XX.XX Shelf X Bay XX [NETAPP X342_SSKBE1T2A10 NA02] S/N [XXXXXXXX] UID [X000XX00:XXXXXXXX:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000]'}
Disk test is run in Maintenance Center due to IO recover error.
Maintenance Center test result is successful.
[?] Thu Jul 25 05:58:47 +0900 [NodeA: pmcsas_timeout_0: sas.device.timeout:error]: Adapter 0x encountered a device timeout on Disk device 0x.XX.XX.
[?] Thu Jul 25 05:59:02 +0900 [NodeA: disk_admin: disk.outOfService:notice]: Drive 0x.XX.XX (WFK8A82P): exceeded latency threshold. Power-On Hours: 32653, GList Count: 0, Drive Info: Disk 0x.XX.XX Shelf X Bay XX [NETAPP X342_SSKBE1T2A10 NA02] S/N [XXXXXXXX] UID [X000XX00:XXXXXXXX:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000].
......
[?] Thu Jul 25 05:59:02 +0900 [NodeA: config_thread: raid.disk.maint.start:notice]: Disk /aggrX/plex0/rg1/0x.XX.XX Shelf X Bay XX [NETAPP X342_SSKBE1T2A10 NA02] S/N [XXXXXXXX] UID [X000XX00:XXXXXXXX:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000] will be tested.
......
[?] Thu Jul 25 10:33:22 +0900 [NodeA: config_thread: raid.disk.maint.done:notice]: Disk 0x.XX.XX Shelf X Bay XX [NETAPP X342_SSKBE1T2A10 NA02] S/N [XXXXXXXX] UID [X000XX00:XXXXXXXX:00000000:00000000:00000000:00000000:00000000:00000000:00000000:00000000] tests were completed successfully.
Then this disk is set as a spare disk for continue using.