Disks in VMware on Windows disconnects and back ups failing to E-Series
Applies to
- NetApp E-Series
- VMware ESXi
- Cisco network switches
Issue
- VMware ESXi host experiencing transient communication issues with E-Series LUNs causing device target resets and back ups to fail.
- E-Series ASUP reports high count of fibre channel link errors on both controllers in
state-capute-data.txt
>fcDump
Executing fcDump(0,0,0,0,0,0,0,0,0,0) on controller A:
fcAll (Tick 3126573628) ==> 04/03/25-09:48:25
5700-A Our Num ::...Exchange Counts...:: Num ..Link Up..
Chip LinkStat Port Port :: :: Link Bad Bad
ID Logi ::Open Total Errors:: Down Char Frame
3-Src Up-Fab 5x0000 7 :: 1 44802101 5924:: 0 0 0
4-Src Up-Fab 9x00x0 7 :: 0 44772041 6616:: 0 0 4
- VMware logs reports FC layer aborts without any underlying or associated SCSI errors:
2025-03-12T23:13:25.531Z cpu22:2097812)nfnic: <1>: INFO: fnic_abort_cmd: 3805: Abort cmd called for Tag: 0x1 issued time: 60438 ms CMD_STATE: FNIC_IOREQ_CMD_PENDING CDB Opcode: 0x2a sc:0x45d97ac00688 flags: 0x3 lun: 3 target: 0x5c0000
2025-03-12T23:13:25.531Z cpu22:2097812)WARNING: nfnic: <1>: fnic_abort_cmd: 3819: Abort for cmd tag: 0x1 in pending state
2025-03-12T23:13:25.531Z cpu22:2097812)nfnic: <1>: INFO: fnic_abort_cmd: 3805: Abort cmd called for Tag: 0x7ef issued time: 60438 ms CMD_STATE: FNIC_IOREQ_CMD_PENDING CDB Opcode: 0x2a sc:0x45d941b46008 flags: 0x3 lun: 3 target: 0x5c0000
- VMware kernel logs report
0x5
and0x7
status errors:
2025-03-12T23:32:52.456Z cpu105:2098331)NMP: nmp_ThrottleLogForDevice:3867: Cmd 0x88 (0x45d979778a88, 6016532) to dev "naa.6d039ea000ae04ea000001d566c472f5" on path "vmhba1:C0:T7699:L3" Failed:
2025-03-12T23:32:52.456Z cpu105:2098331)ScsiDeviceIO: 4176: Cmd(0x45d979778a88) 0x88, CmdSN 0x8000004a from world 6016532 to dev "naa.6d039ea000ae04ea000001d566c472f5" failed H:0x7 D:0x0 P:0x0
2025-03-12T23:33:53.427Z cpu95:2098331)NMP: nmp_ThrottleLogForDevice:3867: Cmd 0x88 (0x45d9797ca388, 6016532) to dev "naa.6d039ea000ae04ea000001d566c472f5" on path "vmhba1:C0:T7699:L3" Failed:
2025-03-12T23:33:53.427Z cpu95:2098331)ScsiDeviceIO: 4124: Cmd(0x45d9797ca388) 0x88, CmdSN 0x80000007 from world 6016532 to dev "naa.6d039ea000ae04ea000001d566c472f5" failed H:0x5 D:0x0 P:0x0