Unable to communicate with node due to base OS file system corruption
Applies to
- NetApp StorageGRID 11.4
- Container-based StorageGRID node
Issue
- Alert on an Admin Node and a Storage Node from Grid Manager, Alert > Current:
Unable to communicate with node
One or more services are unresponsive, or the node cannot be reached
- Both nodes are running on a same base OS
- Unable to collect logs from both nodes. Connection timeout error seen in
info.txt
file in the log tar.gz file:
{
"Node ID": "idididid-idid-idid-idid-idididididid",
"Hostname": "dc2-adm1",
"Error": "Unable to connect to dc2-adm1 on port 22: Net::SSH::ConnectionTimeout"
},
{
"Node ID": "idididid-idid-idid-idid-idididididid",
"Hostname": "dc2-sn1",
"Error": "Unable to connect to dc2-sn1 on port 22: Net::SSH::ConnectionTimeout"
},