跳转到主内容

控制器接管完成自动—新集群配置中出现通信错误警报

Views:
32
Visibility:
Public
Votes:
0
Category:
aff-series
Specialty:
hw
Last Updated:

适用场景

  • AFF A20
  • 初始集群配置/设置

问题描述

  • 报告了意外接管事件。示例

HA Group Notification from node_name (CONTROLLER TAKEOVER COMPLETE AUTOMATIC - Communication Error) ALERT

  • 在配对节点中、会提及以下警报: 
[node_name: statd: cf.takeover.disabled:alert]: HA mode, but takeover of partner is disabled due to reason : unsynchronized log.
使用:
 
[node_name: ThreadHandlerun: cf.fsm.clam.reqPartnerShtdwn:alert]: CLAM requests graceful shutdown of the HA partner to initiate a takeover while NVLOG is out of sync. Cluster and HA connectivity is down.
...
[node_name: cf_main: cf.fsm.takeover.on.reboot:info]: Failover monitor: One node initiated automatic takeover after detecting that its partner node is rebooting.
...
[node_name: shutdown_thread0: ha.localNodeShutDown:notice]: Shutdown of the local node has been initiated with inhibit_takeover set to FALSE.
  • 关闭的节点无法启动至"Waiting for giveback..."状态。
  • 在"::> system switch ethernet show " ONTAP命令行界面输出 或CSHM-switch-config.XML AutoSupport部分检测到意外的不受支持的集群网络交换机(ONTAP硬件系统的交换机文档)。示例
Device Name        switch_name (aa:bb:cc:dd:ee:ff)                                                                    
IP Address           192.168.0.1                                                                                        
Model to display     OTHER                                                                                             
Switch Network     cluster-network                                                                                   
Software Version    switch_name_firmware...                                                                 
Reference Config File Version     NA                                                                                   
SNMP Version         SNMPv2c                                                                                            
...                                                                                                                     
设备的序列号未知                                                                                     
...                                                                                                                     
  • 检测到同一设备 已连接到此平台的集群/HA物理端口: e4a和e4b。
 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.