CRC Errors Detected -在集群端口上检测到较高的CRC错误
适用场景
- ONTAP 9
- FAS/AFA系统
- 集群交换机
问题描述
- 一个集群网络端口每小时报告大量CRC错误。
示例:
[Node-01: vifmgr: vifmgr.cluscheck.hwerrors:alert]: Port e0a on node Node-01 is reporting a high number (at least 1 per 1000 packets) of observed hardware errors (CRC, length, alignment, dropped).
[Node-01: vifmgr: callhome.clus.net.degraded:alert]: Call home for CLUSTER NETWORK DEGRADED: CRC Errors Detected - High CRC errors detected on port e0a node Node-01
- 多个系统运行状况警报指向 问题描述的源。
示例:
NodeIfInErrorsWarnAlert
The percentage of inbound packet errors of node "node_name-01" on interface "e0a" is above the warning threshold.
ClusterIfInErrorsWarn_Alert
The percentage of inbound packet errors of switch interface "switch_name-01/Slot: 0 Port: 2 25G - Level" is above the warning threshold.
The percentage of outbound packet errors of switch interface "switch_name-01/Slot: 0 Port: 1 25G - Level" is above the warning threshold.
- 所有节点的集群LIF都会向一个集群LIF报告总的数据包丢失事件:
[Node-01: vifmgr: vifmgr.cluscheck.droppedall:alert]: Total packet loss when pinging from cluster lif Node-01_clus2 (node Node-01) to cluster lif Node-02_clus2 (node Node-02).
[Node-01: vifmgr: vifmgr.cluscheck.droppedall:alert]: Total packet loss when pinging from cluster lif Node-02_clus1 (node Node-01) to cluster lif Node-02_clus2 (node Node-02).
- 一个端口收到并传输到另一个端口的错误:
BES-53248集群交换机:
(switch_name-01)#show interface 0/1
Interface :.................................. 0/1
Packets Received Without Error................. 24203865
Packets Received With Error.................... 0
Broadcast Packets Received..................... 4279
Receive Packets Discarded...................... 375
Packets Transmitted Without Errors............. 24677379
Transmit Packets Discarded..................... 897376
Transmit Packet Errors......................... 897376
...
(switch_name-01)#show interface 0/2
Interface :.................................. 0/2
Packets Received Without Error................. 15630167
Packets Received With Error.................... 1019693
Broadcast Packets Received..................... 2265
Receive Packets Discarded...................... 13565
Packets Transmitted Without Errors............. 15644072
Transmit Packets Discarded..................... 1156
Transmit Packet Errors......................... 0
...
Cisco集群交换机:
(switch_name-01)#show interface Ethernet1/12
Ethernet1/1/2 is up
admin state is up, Dedicated Interface
Hardware: 10000 Ethernet, address: dc0b.0919.9299 (bia dc0b.0919.9299)
MTU 9216 bytes, BW 10000000 Kbit , DLY 10 usec
RX
5319553944 unicast packets 1070121 multicast packets 1179662 broadcast packets
5579105056 input packets 15897572746528 bytes
1855198116 jumbo packets 0 storm suppression bytes
0 runts 518955 giants 256782374 CRC 0 no buffer
257301329 input error 0 short frame 0 overrun 0 underrun 0 ignored
0 watchdog 0 bad etype drop 0 bad proto drop 0 if down drop
0 input with dribble 0 input discard
0 Rx pause
System health subsystem show
命令可以将 主板和/或交换机运行状况报告为“已分级”并触发NodeIfInErrorsWarnAlert
。