Skip to main content
NetApp Knowledge Base

Why StorageGRID Unable to Communicate with Node alert did not trigger

Views:
Visibility:
Public
Votes:
0
Category:
storagegrid-webscale
Specialty:
HW
Last Updated:

Applies to

NetApp StorageGRID

Answer

The Unable to communicate with node alert looks for the following attributes from Prometheus.
  • (count(up != 1) without (job) > 0) - Checks if a node is unreachable or offline

  • unless on(instance) storagegrid_administratively_down == 1 - Checks if the node is administratively down

The administratively down attribute determines if the node being down is expected. The attribute is set during the following exceptions.
  • Placed into maintenance mode
  • Being decommissioned
  • Undergoing a clone operation
  • Powered offline from the BMC or physically
  • Shut down from CLI
If the administratively down attribute is true, the alert will not trigger.

Additional Information

additionalInformation_text
NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.