How to troubleshoot StorageGRID unable to communicate with node (NDDOWN)
Applies to
- NetApp StorageGRID
- NetApp StorageGRID Appliances
- NetApp StorageGRID VMware based nodes
- NetApp StorageGRID Bare-metal based nodes
Description
This article helps in identifying the cause of a StorageGRID node down. Multiple causes may cause a StorageGRID node to be in a disconnected state, such as:
- Maintenance operations
- Network connectivity issues
- Hardware issues
- File system corruption
- Resource issues (such as CPU, memory or disk). A common symptom of resource issues is sluggish/slow responses within the systems shell/cli/ssh.
- Decommission
StorageGRID may report the following errors:
- StorageGRID node(s) appear next to a blue icon (Disconnected – Unknown) in the Grid Manager
- StorageGRID reports the
Unable to communicate with Node
alert Unexpected node reboot
if the node came back online on its own- StorageGRID reports the
NDDOWN
legacy alarm - If AutoSupport is enabled, it will open a
NDDOWN
AutoSupport incident:CSTARS:StorageGRID Notification from <serial number> (NODE_DOWN-CRITICAL) ERROR