VMware ESXi host has FC disconnections and missing paths
Applies to
- ONTAP
- VMware ESXi
Issue
- All SAN initiators report Partially Connected
- In system Manager Single FC port from each node in 2 node HA pair has 0 MB/s throughput
- In fibre channel environment hundreds of Host logins (PLOGI) across all storage nodes seen in EMS logs
- VMware hosts experience disconnections, all paths down (APD), permanent device loss (PDL)
- All connection issues proceeded from a fabric registered state change notification (RSCN)
- Missing/failed paths for one Storage Target HBA port once system is stable
esxcfg-mpath -b
vmhba2:C0:T6:L1 LUN:1 state:dead fc Adapter: Unavailable Target: Unavailable
vmhba2:C0:T3:L1 LUN:1 state:dead fc Adapter: Unavailable Target: Unavailable
less /var/run/log/vmkernel.log
lpfc_els_rcv_rscn:5945: 0:(0):0214 RSCN received Data:
lpfc_els_rcv_rscn:5955: 0:(0):5973 RSCN received event x0 : Address format x03 : DID x000000
lpfc_handle_status:5079: 0:(0):3271: FCP cmd x8a failed <1/6> sid , did , oxid x7a iotag x417 Invalid RPI Host Retry
NMP: nmp_ThrottleLogForDevice:3861: Cmd 0x8a (,) to dev "naa.xyz" on path "vmhba1:C0:Tx:L1" Failed:
NetApp:
bcomd: san.initiator.login.missing:notice]: Initiator "xyx" is not logged in to any LIFs on this node for Vserver "svm123", portset "". Access to LUNs might be disrupted if a failover occurs.