间歇性无法访问 AIQUM 服务器,有时采集失败
适用于
- Active IQ Unified Manager (AIQUM) 9.8+
- 所有 OS 版本
问题描述
- 有时无法访问 AIQUM 服务器
- 即使 GUI 可访问,集群的获取也会间歇性地失败
- 因此,AIQUM 显示的旧数据(例如:磁盘空间利用率)与当前 ONTAP 数据不匹配
acq.log对于不同的集群有多次出现"Data source is already performing a poll of netappstorageperformance. Therefore this poll is skipped."- 这种情况可能发生在升级之后
- MySQL
error.log频繁出现"server out of memory"记录 - 重新启动后问题会解决一段时间
- 您可能会看到以下错误:
oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),cpuset=/,mems_allowed=0,global_oom,task_memcg=/system.slice/ocie.service,task=java,pid=3879248,uid=997 out of memory: killed process 3879248 (java) total=vm:8816112kb, anon=rss:4154524kb, file-rss:0kb, shmem-rss:0kb, UID:997 pgtables:9072kb oom_score_adj:0 ocie.service: A process of this unit has been killed by the OOM killer. ocie.service: Main process exited, code=killed, status=9/KILL Stopping Active IQ Manamgent Server service ocie
ERROR [oncommand] [org.springframework.jms.listener.DefaultMessageListenerContainer#2-1] [c.n.i.s.r.dao.AcquisitionInfoDao] Data Acquistion Save exception:
org.springframework.dao.DuplicateKeyException: PreparedStatementCallback; SQL [INSERT INTO opm.acquisition_info (time,success,clusterId,analysisDuration,collectionDuration,msgRcvTime,isHistorical)
VALUES (?,?,?,?,?,?,?)]; (conn=19056) Duplicate entry 'XXXXXXXXX-XXXXX' for key 'acquisition_info.PRIMARY'; nested exception is java.sql.SQLIntegrityConstraintViolationException: (conn=19056)
Duplicate entry 'XXXXXXXXX-XXXXX' for key 'acquisition_info.PRIMARY'