由于BES-53248布线不正确、集群加入可能会失败
适用场景
- 集群 网络交换机(CNS) Broadcom BES-53248
- 集群端口 以不同速度运行的交换集群:10G和40G
问题描述
- 将新节点从其他平台迁移到现有集群时出错。示例:
Cluster network RPC communication test from local address 169.254.99.130 to 169.254.247.154
Error: Cluster network RPC communication test from local address 169.254.99.130 to 169.254.247.154 failed with subsequent larger RPC request of size 1024 where size 0 succeeded. Possible MTU mismatch on cluster network ports or network switch.
Reason: f_echo_1: RPC: Timed out; netid=tcp fd=253 TO=5.0s TT=5.001s O=1076b I=0b CN=275/2 VSID=-3 169.254.99.130:40455 <-> 169.254.247.154:7815. Verify the network configuration.- ONTAP事件消息(Event Messages,EMS)报告A
ClusterSwitchConnectivity_Alert。示例:
[node_name: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: ClusterSwitchConnectivity_Alert[node_name].
[node_name: cshmd: hm.alert.raised:alert]: Alert Id = ClusterSwitchConnectivity_Alert , Alerting Resource = node_name raised by monitor ethernet-switch
- 从CNS日志中、我们可以看到相同端口组中的某些链路停机:
switch_name TRAPMGR[trapTask]: traputil.c(753) 34807 %% NOTE Link Down: 0/5, Reason Code: 0x62 <189>
switch_name TRAPMGR[trapTask]: traputil.c(753) 34806 %% NOTE Link Down: 0/6, Reason Code: 0x62 <189>
switch_name TRAPMGR[trapTask]: traputil.c(753) 34799 %% NOTE SFP inserted in 0/7