40G/100G CX5 NIC 连接到 Broadcom BES-53248 集群交换机时出现 CRC 错误

最后更新
另存为PDF

Views:: 31

Visibility:: Public

Votes:: 0

Category:: fas-systems<a>2008899241</a>

Specialty:: hw

Last Updated:

适用场景

FAS/AFF
BES — 53248
集群互连

问题描述

入站数据包错误的系统警报。

示例：

cluster1::*> alert show (system health alert show) Node: cluster1-01 Resource: <switch port> Severity: Major Indication Time: Tue Feb 08 18:01:43 2022 Suppress: false Acknowledge: false Probable Cause: The percentage of inbound packet errors of switch interface "switchname/<switch port>" is above the warning threshold.

CLUSTER NETWORK DEGRADED 由于 CRC Errors Detected 连接到 BES-53248 时所有集群端口上都有。

示例：

<LR d="02Sep2021 13:17:24" n="node1" t="0000000" id="0/26207890441008" p="1" s="Ok" o="vifmgr" vf="" type="1" seq="180177" supp="98">
 <callhome_clus_net_degraded_1
         subject="CLUSTER NETWORK DEGRADED"
         event_type="CRC Errors Detected"
         event_details="High CRC errors detected on port e0c node node1"/>
 </LR>

从集群 LIF 执行 ping 操作时丢失数据包。

示例：

[?] Tue Sep 07 07:43:56 +0800 [node1: vifmgr: vifmgr.cluscheck.ctdpktloss:alert]: Continued packet loss when pinging from cluster lif clusterlif (node node1) to cluster lif clusterlif(node node2)

已多次针对以太网交换机发出集群警报 / 引发碎片。

示例：

[?] Tue Sep 07 02:31:14 +0800 [na02: cshmd: hm.alert.raised:alert]: Alert Id = ClusterIfInErrorsWarn_Alert , Alerting Resource = CS1/Slot: 0 Port: 50 100G - Level raised by monitor ethernet-switch

[?] Tue Sep 07 02:31:14 +0800 [na02: cshmd: hm.alert.cleared:notice]: Alert Id = ClusterIfInErrorsWarn_Alert , Alerting Resource = CS2/Slot: 0 Port: 50 100G - Level cleared by monitor ethernet-switch