SAS port reset with error "Failed to get a response in state 73"
Applies to
- ONTAP 9
Issue
EMS log shows SAS port reset and disk timeout before SAS port reset.
Example:
09 Dec 2020 23:17:50 [node1: Debug] config_thread raid aggr log CP count: aggregate_type="Aggregate" aggregate_name="AGGR_R1APP602" aggregate_uuid="7a9131b4-63bb-4478-96f4-1f681292dc45" home_owner_id="538033388" home_owner_name="node1" CP_count="2226778"
09 Dec 2020 23:18:35 [node1: Debug] config_thread raid aggr log CP count: aggregate_type="Aggregate" aggregate_name="node1_aggr0" aggregate_uuid="80941bab-3b62-4ebb-842c-0ecd80d3f2a9" home_owner_id="538033388" home_owner_name="node1" CP_count="2984531"
09 Dec 2020 23:52:02 [node1: Error] scsi_cmdblk_strthr_admin scsi cmd checkCondition: deviceType="Disk" deviceName="0d.02.10" cdb="0x28:25fe9428:0008" sSenseKey="SCSI:aborted command" sSenseCode="" iSenseKey="0xb" iASC="0x2f" iASCQ="0x10" iFRU="0x0" DTime="4356"
09 Dec 2020 23:52:02 [node1: Error] scsi_cmdblk_strthr_admin scsi cmd checkCondition: deviceType="Disk" deviceName="0d.02.10" cdb="0x28:288a25e8:0008" sSenseKey="SCSI:aborted command" sSenseCode="" iSenseKey="0xb" iASC="0x2f" iASCQ="0x10" iFRU="0x0" DTime="4368"
09 Dec 2020 23:52:02 [node1: Error] scsi_cmdblk_strthr_admin scsi cmd checkCondition: deviceType="Disk" deviceName="0d.02.10" cdb="0x28:288a2930:0008" sSenseKey="SCSI:aborted command" sSenseCode="" iSenseKey="0xb" iASC="0x2f" iASCQ="0x10" iFRU="0x0" DTime="4353"
09 Dec 2020 23:52:02 [node1: Debug] pmcsas_admin_0 scsi cmd retrySuccess: deviceType="Disk" deviceName="0d.02.10" retryCount="0" freeRetryCount="1" cdb="0x28:25fe9428:0008" dTime="4444"
09 Dec 2020 23:52:02 [node1: Debug] pmcsas_admin_0 scsi cmd retrySuccess: deviceType="Disk" deviceName="0d.02.10" retryCount="0" freeRetryCount="1" cdb="0x28:288a2930:0008" dTime="4375"
09 Dec 2020 23:52:02 [node1: Debug] pmcsas_admin_0 scsi cmd retrySuccess: deviceType="Disk" deviceName="0d.02.10" retryCount="0" freeRetryCount="1" cdb="0x28:288a25e8:0008" dTime="4394"
09 Dec 2020 23:52:08 [node1: Debug] pmcsas_timeout_0 sas device quiesce: adapterName="0a" deviceType="disk" deviceName="0d.02.10"
09 Dec 2020 23:52:12 [node1: Error] pmcsas_timeout_0 sas device timeout: adapterName="0a" deviceType="Disk" deviceName="0d.02.10"
09 Dec 2020 23:52:12 [node1: Info] pmcsas_timeout_0 sas adapter debug: adapterName="0a" debug_string="Level 0 timeout: Abort task set: 0d.02.10 L0 (0xfffff81e6dbd6818,0x2f:1aa49c00:0400,0/0)"
09 Dec 2020 23:52:12 [node1: Info] pmcsas_timeout_0 sas adapter debug: adapterName="0a" debug_string="ABORT TASKSET on device 0d.02.10 L0"
09 Dec 2020 23:52:18 [node1: Debug] pmcsas_timeout_0 sas adapter exception: error="Failed to get a response in state 73" adapterName="0a"
09 Dec 2020 23:52:18 [node1: Info] pmcsas_asyncd_0 sas adapter debug: adapterName="0a" debug_string="ABORT TASKSET of device 0d.02.10 L0 failed (retries exhausted) or timed out -- scheduling HARD LINK RESET"