由于已用尽最大连接限制、AIQUM中所有集群的采集失败
适用场景
- ActiveIQ Unified Manager (AIQUM) 9.6及更高版本
- 所有操作系统平台
- ONTAP 9.x
问题描述
- 对于添加到AIQUM中的所有集群、间歇性采集失败
Cluster Monitoring Failed
和Cluster Not Reachable
警报由AIQUM触发- 但是、采集会在一段时间后自动开始工作、或者手动触发
- [1] AIQUM上会应用所有前提条件、例如AV排除以及CPU/内存/磁盘空间方面的资源可用性
- 适用于AIQUM和ONTAP集群的SSL证书有效
- AIQUM
au.log
:
ERROR [common-pool-2064] c.o.s.a.d.n.NetAppOCIEArchivePerformancePackage (NetAppOCIEArchivePerformancePackage.java:381) - Failed to get archive file names from zapi. java.net.SocketTimeoutException: connect timed out
at java.net.PlainSocketImpl.waitForConnect(Native Method) ~[?:?]
...
Wrapped by: com.onaro.sanscreen.acquisition.framework.datasource.DataSourceErrorException: Failed to connect to <cluster IP/Hostname>
at com.onaro.sanscreen.acquisition.datasource.netapp_ocie.transport.zapi.ZAPIConnection.createDefaultNaServer(ZAPIConnection.java:803) ~[au-datasource-netappfoundation.jar:9.13.0-2023.09.J299]
...
ERROR [common-pool-2064] c.o.s.a.f.d.BaseDataSource (DataSourceErrorException.java:246) - <cluster_IP/Hostname> [Error connecting] - Failed to connect to <cluster IP/Hostname> (connect timed out)
- AIQUM
ocumserver.log
:
ERROR [oncommand] [reconciliation-0] [c.n.d.c.ClusterStatusListener] Socket connection error for cluster: <cluster IP/Hostname>java.net.ConnectException: Connection timed out: connect
ERROR [oncommand] [reconciliation-0] [c.n.d.c.ClusterStatusListener] Cluster : <cluster IP/Hostname> is not reachable. Generating cluster not reachable event.