Skip to main content
NetApp Knowledge Base

All collectors run into Error status with Error Code AGENT008 at the same time

Views:
771
Visibility:
Public
Votes:
6
Category:
data-infrastructure-insights
Specialty:
oci
Last Updated:

Applies to

  • Data Infrastructure Insights (DII) (formerly Cloud Insights)
  • Storage Workload Security

Issue

  • subject: Critical Health Alert: Storage Workload Security Data Collector '<Collector Name>' is disconnected 

Description: SVM Data Collector '<Collector Name>' is disconnected. The SVM is not monitored and protected.
Error: Failed to determine the health of the collector within 2 retries, try restarting the collector again(Error Code: AGENT008)

  • subject: Warning Health Alert: Storage Workload Security User Directory Collector '<Collector Name>' is disconnected

Description: User Directory Collector '<Collector Name>' is disconnected. Users' information is not updated. 
Error: Failed to determine the health of the collector within 2 retries, try restarting the collector again(Error Code: AGENT008) 

  • All collectors displayed via followings are in Error status with message:
    • Workload Security > Collectors > Data Collectors
    • Workload Security > Collectors > User Directory Collectors

Failed to determine the health of the collector within 2 retries, try restarting the collector again(Error Code: AGENT008)

All collectors run into Error status with Error Code AGENT008 at the same time

  • agent.log indicates that it fails to get status of collector with certificate_unknown then removes it from the monitoring target

[ERROR] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceStateManagerActor] - Failed to get state of <DATASOURCE_UUID>, reason: NotAfter: <TIMESTAMP>
..
[ERROR] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceStateManagerActor] - Failed to get state of <DATASOURCE_UUID>, reason: Received fatal alert: certificate_unknown
..
[INFO] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceStateManagerActor] - Removed collector: <DATASOURCE_UUID> from monitoring
..
[INFO] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceStateManagerActor] - All collector health status has been updated- stateMap: [Map(<DATASOURCE_UUID> -> error)], statusMap: [Map(<DATASOURCE_UUID> -> Failed to determine the health of the collector within 2 retries, try restarting the collector again(Error Code: AGENT008))]
..
[WARN] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceJvm] - Skipped Refresh Jwt as the datasource <DATASOURCE_UUID> is not running

 

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.