40G/100G CX5 NIC 连接到 Broadcom BES-53248 集群交换机时出现 CRC 错误
适用场景
- FAS/AFF
- BES — 53248
- 集群互连
问题描述
- 入站数据包错误的系统警报。
示例:
cluster1::*> alert show
(system health alert show)
Node: cluster1-01
Resource: <switch port>
Severity: Major
Indication Time: Tue Feb 08 18:01:43 2022
Suppress: false
Acknowledge: false
Probable Cause: The percentage of inbound packet errors of switch
interface "switchname/<switch port>" is above the warning threshold.
CLUSTER NETWORK DEGRADED
由于CRC Errors Detected
连接到 BES-53248 时所有集群端口上都有。
示例:
<LR d="02Sep2021 13:17:24" n="node1" t="0000000" id="0/26207890441008" p="1" s="Ok" o="vifmgr" vf="" type="1" seq="180177" supp="98">
<callhome_clus_net_degraded_1
subject="CLUSTER NETWORK DEGRADED"
event_type="CRC Errors Detected"
event_details="High CRC errors detected on port e0c node node1"/>
</LR>
- 从集群 LIF 执行 ping 操作时丢失数据包。
示例:
[?] Tue Sep 07 07:43:56 +0800 [node1: vifmgr: vifmgr.cluscheck.ctdpktloss:alert]: Continued packet loss when pinging from cluster lif clusterlif (node node1) to cluster lif clusterlif(node node2)
- 已多次针对以太网交换机发出集群警报 / 引发碎片。
示例:
[?] Tue Sep 07 02:31:14 +0800 [na02: cshmd: hm.alert.raised:alert]: Alert Id = ClusterIfInErrorsWarn_Alert , Alerting Resource = CS1/Slot: 0 Port: 50 100G - Level raised by monitor ethernet-switch
[?] Tue Sep 07 02:31:14 +0800 [na02: cshmd: hm.alert.cleared:notice]: Alert Id = ClusterIfInErrorsWarn_Alert , Alerting Resource = CS2/Slot: 0 Port: 50 100G - Level cleared by monitor ethernet-switch