Multiple StorageGRID storage nodes reboot in short timeframe
Applies to
- NetApp StorageGRID pre 11.6.0.12
- NetApp StorageGRID pre 11.7.0.5
Issue
- GRID Manager reports unresponsive services on a number of storage nodes.
- Example:
Plattform Services Unavailable alert ("too few storage nodes with the RSM service running or available at a site
- Example:
- Multiple storage nodes reboot unexpectedly and report:
Unexpected node reboot
and / orUnable to communicate with Node
- When going to Support > Metrics > Prometheus and running
storagegrid_private_client_connections
, it shows a sharp spike in connection count up to a few K(where K=1000)