跳转到主内容

由于集群端口上存在虚电路、节点重新启动后的集群通信问题描述

Views:
26
Visibility:
Public
Votes:
0
Category:
fas-systems<a>2009798765</a>
Specialty:
hw
Last Updated:

适用场景

  • AFF A700
  • 无交换机集群

问题描述

  • 在窗口维护期间、节点会手动重新启动
  • 节点重新启动后、出现集群通信问题、从而导致以下输出
  • 配对节点的聚合显示为未知:
cluster::> aggr show

Info: Node cluster-NodeB that hosts aggregate cluster_DATA_AGGR is
offline
Node cluster-NodeB that hosts aggregate cluster_ROOT is
offline


Aggregate Size Available Used% State #Vols Nodes RAID Status
--------- -------- --------- ----- ------- ------ ---------------- ------------
clusterA_DATA_AGGR
44.24TB 11.31TB 74% online 33 cluster- raid_dp,
NodeA normal
clusterA_ROOT
992.7GB 48.09GB 95% online 1 cluster- raid_dp,
NodeA normal
clusterB_DATA_AGGR
- - - unknown - cluster- -
NodeB
clusterB_ROOT
- - - unknown - cluster- -
NodeB
4 entries were displayed.

  • 集群端口已启动且似乎运行正常

cluster::> port show

Node: cluster-NodeA
Speed(Mbps) Health
Port IPspace Broadcast Domain Link MTU Admin/Oper Status
--------- ------------ ---------------- ---- ---- ----------- --------
e0M Default - up 1500 auto/1000 -
e0a Cluster - up 9000 auto/40000 -
e0e Cluster - up 9000 auto/40000 -
e0f Default - up 9000 auto/10000 -
e0g Default - up 9000 auto/10000 -
e0h Default - up 9000 auto/10000 -
e0i Default - up 9000 auto/10000 -
e5a Default - up 9000 auto/40000 -
e5e Default - down 9000 auto/auto -
9 entries were displayed.

  • 对于配对节点、集群环会显示脱机

cluster::> set diagnostic

Warning: These diagnostic commands are for use by NetApp personnel only.
Do you want to continue? {y|n}: y

cluster::*> cluster ring show
Node UnitName Epoch DB Epoch DB Trnxs Master Online
--------- -------- -------- -------- -------- --------- ---------
cluster-NodeA
mgmt 0 38 137123 - offline
cluster-NodeA
vldb 0 38 138432 - offline
cluster-NodeA
vifmgr 0 38 794750 - offline
cluster-NodeA
bcomd 0 38 80 - offline
cluster-NodeA
crs 0 38 1 - offline
cluster-NodeB
mgmt 40 40 446 cluster-NodeB
master
cluster-NodeB
vldb 40 40 187 cluster-NodeB
master
cluster-NodeB
vifmgr 40 40 60 cluster-NodeB
master
cluster-NodeB
bcomd 40 40 7 cluster-NodeB
master

Node UnitName Epoch DB Epoch DB Trnxs Master Online
--------- -------- -------- -------- -------- --------- ---------
cluster-NodeB
crs 40 40 1 cluster-NodeB
master
10 entries were displayed.

  • 此节点将报告集群运行状况为false

cluster::> cluster show
Node Health Eligibility
--------------------- ------- ------------
cluster-NodeA false true
cluster-NodeB true true

Warning: Cluster HA is not working correctly. Make sure that both nodes are healthy by using the "cluster show" command; then reconfigure
cluster HA to correct the configuration. Check the output of "cluster ha show" following the reconfiguration to verify node
health. If reconfiguring cluster HA does not resolve the issue, contact technical support for assistance.
2 entries were displayed.

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.