StorageGrid Replicated State Machine "RSM" stuck in starting state
Applies to
- StorageGRID WebScale 11.2.x and later
Issue
- Upgrade hangs while starting services for the first 3 nodes a single site
- Output of 'storagegrid-status' shows RSM in a starting state
- File called 'expansion-started' exists within '/var/local/rsm' directory
- Example error meebycast-err.log:
- rsm[11204]: [raft.go:135] CRITICAL: Failed to join cluster
- servermanager.log:
- rsm | RSM is not ready because there is no cluster or the cluster has no leader