由于ONTAP数据收集器的内部错误、Data Infrastructure Insight性能轮询失败

最后更新
另存为PDF

Views:: 19

Visibility:: Public

Votes:: 0

Category:: data-infrastructure-insights

Specialty:: oci

Last Updated:

适用场景

Data Infrastructure Insight (DII)(原Cloud Insights)
ONTAP 9
ONTAP System Manager
NetApp ONTAP数据管理软件数据收集器(已启用高级指标)

问题描述

无法获取性能数据、因为性能轮询失败、并且数据收集器在DII 或OnCommand System Manager的登录页面中显示以下消息：

Unable to poll performance ... error = Performance Recent Status Internal error: com.onaro.sanscreen.acquisition.framework.datasource.DataSourceErrorException: General Error

或

Data ONTAP API fail: System busy: 7 requests on table "perf_object_get_instances" have been pending for 1678674 seconds. The last completed call took 0 seconds.

查看ONTAP集群中相关数据收集器的storageperformance 示例日志(位于acq folder storageperformance_datacollectorname > one of the timestamp folders log_sample.log 中的错误报告中)时，可能会出现以下错误消息：

示例：

2021-03-12 17:19:33,895 ERROR [com.onaro.sanscreen.acquisition.datasource.netapp_ontap.NetAppOntapPerformancePackage] datalake collect and report (Poll Count: 1207, Is Macro Poll: false) : [storageperformance] data-collector-name: 1 apis failed: [storageperformance] data-collector-name: perf-object-get-instances(Object : workload) failed: Trying to perform arithmetic between two counters with different cardinality. Counter "read_io_type" has 1 elements, but the other counter "read_io_type" has 10 elements. (1 times)

2021-03-12 17:21:54,206 ERROR [com.onaro.sanscreen.acquisition.datasource.netapp_ontap.builder.ZapiIterBase] Aborting all performance api calls due to: perf-object-instance-list-info-iter(Object : lif) failed: System busy: 7 requests on table "perf_object_instance_list_info" have been pending for 2922550 seconds. The last completed call took 0 seconds.

2022-03-19 01:13:22,377 ERROR [com.onaro.sanscreen.acquisition.datasource.netapp_ontap.NetAppOntapPerformancePackage] datalake collect and report (Poll Count: 10124, Is Macro Poll: false) : [storageperformance] data-collector-name: 15 apis failed: [storageperformance] data-collector-name: perf-object-get-instances(Object : workload) failed: RPC: Remote system error [from mgwd on node "node_name" (VSID: -1) to cm at 127.0.0.1] (1 times)

此外、如果尝试在命令行界面中对集群SVM运行statistics lif show 命令(通过集群中任一节点的节点管理LIF进行访问)、如下图所示、也可能会出现类似错误。

注意：尽管performance-object 两条错误消息之间的特定 ZAPI调用可能不同、但该错误消息应与从DII中性能示例日志中获得的突出显示部分相同：

cluster1::> statistics lif show -vserver cluster1 Error: command failed: System busy: 7 requests on table "perf_object_get_instances" have been pending for 1147109 seconds. The last completed call took 0 seconds.