StorageGRID Unable to Communicate with node due to unresponsive Jaeger service
Applies to
NetApp StorageGRID Admin/Gateway nodes
Issue
The Jaeger-agent or Jaeger-collector becomes unresponsive for a short period causing the Unable to Communicate with node alert to trigger.
nginx/error.log
2024-11-03_2245-2345/nginx/error.log:2024/11/03 23:38:53 [error] 1812680#1812680: *40277633 upstream prematurely closed connection while reading response header from upstream, client: X.X.X.X, server: _, request: "GET /metrics/jaegercollector HTTP/2.0", upstream: "http://127.0.0.1:14269/metrics", host: "NODE_NAME:9999"