VASA Provider: VMware Snapshot operations timeout with "Reached maximum split load allowed on the node" error
Applies to
- ONTAP Tools for VMware vSphere 9.13 and earlier
- VASA Provider
- VMware vSphere vVol (virtual volumes)
Issue
VMware snapshot operations fail when:
- The VM (which resides on a vVol datastore) has a large number of fairly large disks (20+ 2TB disks)
- There is evidence that the volume file clone split load "Allowable Split Load" is exceeded
When troubleshooting the issue, you see the following behavior:
- In VASA Provider's (VP) vvolvp.log, the following '
clone-token-create
' API calls fail during VP's 'prepareToSnapshotVirtualVolume
' task with the following reason:
[cxf-12345:prepareToSnapshotVirtualVolume-123] NaException com.netapp.offtap3.ontap.ApiFailedException: Failed to create token for node "node01". Reason: Reached maximum split load allowed on the node. (errno=13001)
- During the snapshot operation, you see a high 'Token Reserved Load' and the 'Allowable Split Load' pool is exhausted:
ss1ref::> volume file clone split load show
Node Max Current Token Allowable
Split Load Split Load Reserved Load Split Load
------------------------------- ---------- ---------- ------------- ----------
node-01 470TB 0B 466.9TB 3.06TB
node-02 470TB 0B 378TB 92.00TB