Lost access to volume on ESXi host due to faulty HBA
Applies to
- FC LUN
- Brocade switch
- Ontap 9.x
Issue
- FC LUN lost its access on host end due to connectivity issues
IO WQE
errors were observed on storge side as below on both the nodes
Mon Aug 21 23:18:43 +0700 [NETxx-02: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:5a IO WQE failure, Handle 0x4, Type 8, S_ID: 49xx00, VPI: 3, OX_ID: 892, Status 0x3 Ext_Status 0x16
Mon Aug 21 23:19:34 +0700 [NETxx-02: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:1a IO WQE failure, Handle 0x0, Type 8, S_ID: 49xx00, VPI: 3, OX_ID: 887, Status 0x3 Ext_Status 0x16
C3 timeout Tx
errors were observed on the ISL ports of brocade switch
/fabos/cliexec/porterrshow :
frames enc crc crc too too bad enc disc link loss loss frjt fbsy c3timeout pcs uncor
tx rx in err g_eof shrt long eof out c3 fail sync sig tx rx err err
46: 2.1t 558.7g 0 0 0 0 0 0 0 159.6k 5 0 11 0 0 159.5k 0 0 427.4k
47: 2.1t 558.7g 0 0 0 0 0 0 0 153.9k 6 0 11 0 0 153.9k 0 0 2.0k
Frame timeout detected
errors reported under errrdump indicating the frame could not be sent to the ISL port
2023/06/08-11:39:48, [AN-1014], 184556, FID 128, INFO, ssw-bjb-farm-drc-01a, Frame timeout detected, tx port 46 rx port 0, sid 10001, did 490400, timestamp 2023-06-08 11:39:48 .
2023/06/08-11:39:47, [AN-1014], 184536, FID 128, INFO, ssw-bjb-farm-drc-01a, Frame timeout detected, tx port 46 rx port 0, sid 10001, did 490400, timestamp 2023-06-08 11:39:47 .
2023/06/08-11:41:34, [AN-1014], 184637, FID 128, INFO, ssw-bjb-farm-drc-01a, Frame timeout detected, tx port 47 rx port 0, sid 10001, did 490400, timestamp 2023-06-08 11:41:34