ONTAP SAN FC Windows what are SRB timeouts and how to troubleshoot them?
Applies to
- ONTAP 9.x
- ONTAP DSM all versions
- Microsoft Windows Server 2012 and above
Answer
This is a fairly generic error that basically tells us connectivity has been disrupted between the host and the storage. This can be caused by any number of reasons including the following:
- SAN Fabric connectivity and hardware component health such as SFPs, cables, HBAs, switches, etc....
- Storage performance
- Host MPIO and interoperability (HBA, host OS, ONTAP release, etc...)
- MPIO performance and load balancing
Additional Information
Error example in windows event log:
- IO error reported on LUN x on Path Id 0x00000x. The IO will be retried.
- IO error: SRB Status Command timeout reported on LUN x on Path Id 0x000x01. The IO will be retried.
- 61123 ontapdsm IO error: SRB Status Timeout reported on LUN x on Path Id 0x000101. The IO will be retried.
- 61125 ontapdsm IO error: SRB Status Command timeout reported on LUN x on Path Id 0x000901.The IO will be retried.
- 61123:The port servicing the specified LUN and path ID (I_T nexus) reported that an I/O operation timed out.
- 61125:The port servicing the specified LUN and path ID (I_T nexus) reported that an I/O command Knowledge base timed out. The I/O request will be retried.