控制器接管完成自动—新集群配置中出现通信错误警报
适用场景
- AFF A20
- 初始集群配置/设置
问题描述
- 报告了意外接管事件。示例:
HA Group Notification from node_name (CONTROLLER TAKEOVER COMPLETE AUTOMATIC - Communication Error) ALERT
- 在配对节点中、会提及以下警报:
[node_name: statd: cf.takeover.disabled:alert]: HA mode, but takeover of partner is disabled due to reason : unsynchronized log.使用:
[node_name: ThreadHandlerun: cf.fsm.clam.reqPartnerShtdwn:alert]: CLAM requests graceful shutdown of the HA partner to initiate a takeover while NVLOG is out of sync. Cluster and HA connectivity is down....[node_name: cf_main: cf.fsm.takeover.on.reboot:info]: Failover monitor: One node initiated automatic takeover after detecting that its partner node is rebooting....[node_name: shutdown_thread0: ha.localNodeShutDown:notice]: Shutdown of the local node has been initiated with inhibit_takeover set to FALSE.- 关闭的节点无法启动至"
Waiting for giveback..."状态。 - 在"
::> system switch ethernet show" ONTAP命令行界面输出 或CSHM-switch-config.XML AutoSupport部分检测到意外的不受支持的集群网络交换机(ONTAP硬件系统的交换机文档)。示例:
Device Name switch_name (aa:bb:cc:dd:ee:ff) IP Address 192.168.0.1 Model to display OTHER Switch Network cluster-network Software Version switch_name_firmware... Reference Config File Version NA SNMP Version SNMPv2c ...
设备的序列号未知
...
- 检测到同一设备 已连接到此平台的集群/HA物理端口: e4a和e4b。