通过BES-53248交换机将A250节点添加到现有集群后、出现混乱状态
适用场景
- 从NetApp购买的Broadcom BES-53248交换机
- 集群扩展添加使用共享集群/HA端口的节点:
- AFF A320
- AFF A250
- FAS500f
问题描述
- 通过BES-53248交换机将AFF A250节点添加到现有集群时、使用缆线将新节点连接到集群交换机会导致集群端口关闭并引发崩溃:
May 08 00:10:01 [cluster-01:vifmgr.clus.linkdown:EMERGENCY]: The cluster port e0a on node cluster-01 has gone down unexpectedly.
May 08 00:11:13 [cluster-01:vifmgr.clus.linkdown:EMERGENCY]: The cluster port e0b on node cluster-01 has gone down unexpectedly.
PANIC : Received PANIC packet from partner, receiving message is (Coredump and takeover initiated because Connectivity, Liveliness and Availability Monitor (CLAM) has determined this node is out of quorum.)
- 连接到BES-53248交换机上端口0/1和0/2的现有节点集群端口、速度为10 G
- AFF A250节点以25G速度插入交换机端口0/3和0/4
- 现有节点将显示集群端口脱机、并等待集群应用程序联机:
Takeover
Node Partner Possible State Description
-------------- -------------- -------- -------------------------------------
cluster1-01 cluster1-02 - Waiting for cluster applications to
come online on the local node
Offline applications: mgmt, vldb,
vifmgr, bcomd, crs, scsi blade, clam.
cluster1-02 cluster1-01 true Connected to cluster1-01, Partial
giveback
cluster1::> net int show
(network interface show)
Logical Status Network Current Current Is
Vserver Interface Admin/Oper Address/Mask Node Port Home
----------- ---------- ---------- ------------------ ------------- ------- ----
Cluster
cluster1-01_clus1
up/- 169.254.11.190/16 cluster1-01 e0c false
cluster1-01_clus2
up/- 169.254.122.210/16 cluster1-01 e0c true
cluster1-02_clus1
up/down 169.254.10.90/16 cluster1-02 e0c false
cluster1-02_clus2
up/down 169.254.40.40/16 cluster1-02 e0c true
- 交换机日志显示、一旦连接新节点、现有集群端口就会关闭:
<189> May 7 16:05:42 switch1 TRAPMGR[trapTask]: traputil.c(721) 15624 %% NOTE Link Down: 0/1
<189> May 7 16:05:42 switch1 TRAPMGR[trapTask]: traputil.c(721) 15623 %% NOTE Link Down: 0/2
<189> May 7 16:05:42 switch1 TRAPMGR[trapTask]: traputil.c(721) 15620 %% NOTE SFP inserted in 0/3