Random iSCSI disconnects due to custom protection domain rebalancing conflict
Applies to
- NetApp SolidFire Element software 12.0
- Clusters with Custom Protection Domains configured
Issue
- Random iSCSI disconnects occur on hosts connected to a SolidFire cluster
- Kubernetes pods fail, and PVCs become unavailable or read-only; filesystems are left in a broken state
- SolidFire event log shows alternating events every ~15 minutes:
Slice Reassignment: Balancing volumes for zone toleranceSlice Reassignment: Balancing volumes for performance
- No errors in SolidFire node logs
- Packet trace shows:
SCSI Check Condition, Sense Key Not Ready (0x02), ASC/ASCQ Logical Unit Is In Process Of Becoming Ready (0x0401) in response to Test Unit Ready commands
