由于ONTAP数据收集器的内部错误、Data Infrastructure Insight性能轮询失败
适用场景
- Data Infrastructure Insight (DII)(原Cloud Insights)
- ONTAP 9
- ONTAP System Manager
- NetApp ONTAP数据管理软件数据收集器(已启用高级指标)
问题描述
- 无法获取性能数据、因为性能轮询失败、并且数据收集器在DII 或OnCommand System Manager的登录页面中显示以下消息:
Unable to poll performance ... error = Performance Recent Status
Internal error:
com.onaro.sanscreen.acquisition.framework.datasource.DataSourceErrorException: General Error
或
Data ONTAP API fail: System busy: 7 requests on table "perf_object_get_instances" have been pending for 1678674 seconds. The last completed call took 0 seconds.
- 查看ONTAP集群中相关数据收集器的
storageperformance
示例日志(位于acq folder
storageperformance_datacollectorname
>one of the timestamp folders
log_sample.log
中的错误报告中)时, 可能会出现以下错误消息:
示例:
2021-03-12 17:19:33,895 ERROR [com.onaro.sanscreen.acquisition.datasource.netapp_ontap.NetAppOntapPerformancePackage] datalake collect and report (Poll Count: 1207, Is Macro Poll: false) : [storageperformance] data-collector-name: 1 apis failed: [storageperformance] data-collector-name: perf-object-get-instances(Object : workload) failed: Trying to perform arithmetic between two counters with different cardinality. Counter "read_io_type" has 1 elements, but the other counter "read_io_type" has 10 elements. (1 times)
2021-03-12 17:21:54,206 ERROR [com.onaro.sanscreen.acquisition.datasource.netapp_ontap.builder.ZapiIterBase] Aborting all performance api calls due to: perf-object-instance-list-info-iter(Object : lif) failed: System busy: 7 requests on table "perf_object_instance_list_info" have been pending for 2922550 seconds. The last completed call took 0 seconds.
2022-03-19 01:13:22,377 ERROR [com.onaro.sanscreen.acquisition.datasource.netapp_ontap.NetAppOntapPerformancePackage] datalake collect and report (Poll Count: 10124, Is Macro Poll: false) : [storageperformance] data-collector-name: 15 apis failed: [storageperformance] data-collector-name: perf-object-get-instances(Object : workload) failed: RPC: Remote system error [from mgwd on node "node_name" (VSID: -1) to cm at 127.0.0.1] (1 times)
- 此外、如果尝试 在命令行界面中对集群SVM运行
statistics lif show
命令(通过集群中任一节点的节点管理LIF进行访问)、如下图所示、也可能会出现类似错误。
注意:尽管performance-object
两条错误消息之间的特定 ZAPI调用可能不同、但该错误消息应与从DII中性能示例日志中获得的突出显示部分相同:
cluster1::> statistics lif show -vserver cluster1
Error: command failed: System busy: 7 requests on table "perf_object_get_instances" have been pending for 1147109 seconds. The last completed call took 0 seconds.