所有收集器同时进入错误状态并显示错误代码AGENT008
适用场景
- Cloud Insights (CI)
- 存储工作负载安全性
问题描述
- mailto:accounts@service.cloudinsights.netapp.com 发送来自accounts@service.cloudinsights.netapp.com的警报电子邮件:
- 主题:
Critical Health Alert: Storage Workload Security Data Collector '<Collector Name>' is disconnected
Description: SVM Data Collector '<Collector Name>' is disconnected. The SVM is not monitored and protected.
Error: Failed to determine the health of the collector within 2 retries, try restarting the collector again(Error Code: AGENT008)
- 主题:
Warning Health Alert: Storage Workload Security User Directory Collector '<Collector Name>' is disconnected
Description: User Directory Collector '<Collector Name>' is disconnected. Users' information is not updated.
Error: Failed to determine the health of the collector within 2 retries, try restarting the collector again(Error Code: AGENT008)
- 通过以下内容显示的所有收集器均处于
Error
状态、并显示消息:- 工作负载安全性 > 收集器 > 数据收集器
- 工作负载安全性 > 收集器 > 用户目录收集器
Failed to determine the health of the collector within 2 retries, try restarting the collector again(Error Code: AGENT008)
agent.log
表示无法通过获取收集器状态certificate_unknown
、然后将其从监控目标中删除
[ERROR] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceStateManagerActor] - Failed to get state of <DATASOURCE_UUID>, reason: NotAfter: <TIMESTAMP>
..
[ERROR] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceStateManagerActor] - Failed to get state of <DATASOURCE_UUID>, reason: Received fatal alert: certificate_unknown
..
[INFO] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceStateManagerActor] - Removed collector: <DATASOURCE_UUID> from monitoring
..
[INFO] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceStateManagerActor] - All collector health status has been updated- stateMap: [Map(<DATASOURCE_UUID> -> error)], statusMap: [Map(<DATASOURCE_UUID> ->Failed to determine the health of the collector within 2 retries, try restarting the collector again(Error Code: AGENT008))]
..
[WARN] [prod] [<TENANT_ID>] [<AGENT_UUID>] [agent-AgentDataSourceJvm] - Skipped Refresh Jwt as the datasource <DATASOURCE_UUID> is not running