ESXi host reports APD issue on ONTAP NFS datastore
Applies to
- ONTAP Version: 9.15.1P7
- VMware ESXi Version: 8.0.3
Issue
One ESXi host is unable to access the ONTAP NFS volume datastore, affecting all virtual machines on that host. Other ESXi hosts on the same network, connected to the same datastore, are not experiencing any issue
ESXi log:
[vob.storage.apd.start] Device or filesystem with identifier [xxxxxx-xxxxxxx] has entered the All Paths Down state
WARNING: NFS: xxx: Lost connection to the server xxxx mount point /xxxx, mounted as xxxxxx-xxxxxx-0000-000000000000 ("xxxxxxxx")
ONTAP log:
kernel: Nblade.nfsConnResetAndClose:error]: Shutting down connection with the client. Vserver ID is xx; network data protocol is NFS, Rpc Xid xxxxx; client IP address:port is x.x.x.x:xxxx. local IP address is x.x.x.x; reason is CSM error - Maximum number of rewind attempts has been exceeded
ERROR Nblade.CallbackTimedOut: SM NOTIFY: Vserver xx, Vif xxx: PORTMAP program (Program number:100000 Program version:2) on client x.x.x.x is not responding
Troubleshoot the Issue:
- Check if other ESXi hosts connected to the same ONTAP NAS datastore are experiencing the issue
- Confirm that other ESXi hosts connected to the reported ONTAP NAS datastore are not seeing the issue
- Verify that you can ping and traceroute to the client IP from the ONTAP LIF
- Review packet traces collected from the ONTAP NAS vserver LIF. Ensure they show packets from other ESXi hosts connected to the reported ONTAP NAS datastore, but not from the affected ESXi host