如何收集HA IC互连链路断开或RDMA关闭问题的日志
问题描述
HA IC用于以下用途:
- NVRAM镜像
- 交换检测点、启动状态信息和故障转移状态
- 支持HA对中的控制器故障转移(CFO或根聚合)和存储故障转移(SFO或数据聚合)功能
内部系统上的HA IC链路通过InfiniBand连接或MCC IP上的软件iWARP实现。
在CVO AWS和GCP上—HA IC链路基于软件MVIA、而在CVO Azure和ONTAP Select HA IC基于软件iWARP
HA IC或RDMA关闭的影响:
- 已禁用接管
- 未同步的NVRAM日志
Cluster::*> system ha interconnect status show Node: Cluster-01 Link Status: up IC RDMA Connection:down Node: Cluster-02 Link Status: up IC RDMA Connection: down 2 entries were displayed.
Cluster::*> storage failover show Takeover Node Partner Possible State Description -------------- -------------- -------- ------------------------------------- Cluster-01 Cluster-02 false Waiting for Cluster-02, Takeover is not possible: NVRAM log not synchronized Cluster-02 Cluster-01 false Waiting for Cluster-01, Takeover is not possible: NVRAM log not synchronized
EMS.log: ONTAPSelect-A ALERT callhome.hainterconnect.down: Call home for HA INTERCONNECT DOWN due to peer not connected. ONTAPSelect-A ERROR ic.HAInterconnectDown: HA interconnect: Interconnect down for 839 minutes: peer not connected ONTAPSelect-A ALERT cf.takeover.disabled: HA mode, but takeover of partner is disabled due to reason : unsynchronized log.