LUNs online but not accessible after node unjoin from the cluster
Applies to
- ONTAP 9
- ESXi
- SAN
- FCP
Issue
- Issue is introduced by cluster node removal or manipulation on affected ONTAP releases and is not necessarily immediately obvious
- High CPU usage at node level due to ssan_exempt
- SAN Host lost access when path failover is carried out on impacted LIF
- Scenarios that may manifest issue: HA takeover for Power failure, Over temperature, update etc and result in Host server not able to access the FC (Fiber Channel) target correctly.
2021-07-29T09:18:37.637Z cpu7:2097711)VMW_SATP_ALUA: satp_alua_issueCommandOnPath:735: Path "vmhba2:C0:T7:L51" (UP) command 0xa3 failed with status Timeout. H:0x5 D:0x0 P:0x0 Invalid sense data: 0x0 0x0 0x0.
2021-07-29T09:18:47.637Z cpu7:2097711)VMW_SATP_ALUA: satp_alua_issueCommandOnPath:735: Path "vmhba1:C0:T7:L51" (UP) command 0xa3 failed with status Timeout. H:0x5 D:0x0 P:0x0 Invalid sense data: 0x0 0x0 0x0.
2021-07-29T09:18:57.637Z cpu7:2097711)VMW_SATP_ALUA: satp_alua_issueCommandOnPath:735: Path "vmhba2:C0:T6:L51" (UP) command 0xa3 failed with status Timeout. H:0x5 D:0x0 P:0x0 Invalid sense data: 0x0 0x0 0x0.
network interface show
shows UUID (not node name) for the nodes that were removed from the cluster:
::*> network interface show
Logical Status Network Current Current Is
Vserver Interface Admin/Oper Address/Mask Node Port Home
----------- ---------- ---------- ------------------ ------------- ------- ----
FC_vserver
LIF_1 up/up 20:0b:00:a0:98:xx:xx:xx Node_1 0c true
LIF_2 up/up 20:07:00:a0:98:xx:xx:xx Node_2 0d true
LIF_3 up/- 20:0c:00:a0:98:xx:xx:xx 507049c5-fd62-11e5-9977-3bcadcfd9d22 0c true
LIF_4 up/- 20:08:00:a0:98:xx:xx:xx 507049c5-fd62-11e5-9977-3bcadcfd9d22 0d true