由于集群端口上的CRC错误较高、集群网络已降级
适用场景
- ONTAP 9
- FAS/AFA系统
- CN1610集群交换机
- BES-53248集群交换机
问题描述
- 由于CRC错误、集群网络已降级、并且事件日志中会显示以下错误:
[Node-01: intr: netif.linkErrors:error]: Excessive link errors on network interface e0b. Might indicate a bad cable, switch port, or NIC, or that a cable connector is not fully inserted in a socket. On a 10/100 port, might indicate a duplex mismatch.
[Node-01: vifmgr: vifmgr.cluscheck.hwerrors:alert]: Port e0a on node Node-01 is reporting a high number (at least 1 per 1000 packets) of observed hardware errors (CRC, length, alignment, dropped).
[Node-01: vifmgr: callhome.clus.net.degraded:alert]: Call home for CLUSTER NETWORK DEGRADED: CRC Errors Detected - High CRC errors detected on port e0a node Node-01
- 如果在集群端口上发现链路翻盖、则事件日志中会显示以下警报:
[Node-01: vifmgr: vifmgr.port.monitor.failed:error]: The "link_flapping" health check for port e0a (node Node-01) has failed. The port is operating in a degraded state.
[Node-01: vifmgr: callhome.clus.net.degraded:alert]: Call home for CLUSTER NETWORK DEGRADED: Frequent Link Flapping - Cluster port e0a on node Node-01 has experienced multiple link down notifications.
- 在所有节点的集群端口上观察到较高的CRC错误:
::> system node run -node <node-name> -command ifstat <port-name>
-- interface e0a (4 days, 14 hours, 42 minutes, 47 seconds) --
RECEIVE
Total frames: 86771k | Frames/second: 218 | Total bytes: 289g
Bytes/second: 727k | Total errors: 65389 | Errors/minute: 10
Total discards: 0 | Discards/minute: 0 | Multi/broadcast: 121k
Non-primary u/c: 0 | CRC errors: 22207 | Runt frames: 0
Fragment: 0 | Long frames: 0 | Jabber: 41971
Length errors: 1211 | No buffer: 0 | Xon: 0
Xoff: 0 | Pause: 0 | Jumbo: 31475k
Noproto: 0 | Error symbol: 243k | Illegal symbol: 217k
Bus overruns: 0 | Queue drops: 0 | LRO segments: 62544k
- 在交换机端也观察到大量Rx和Tx错误和端口挡板:
#show interface counters
Port InOctets InUcastPkts InMcastPkts InBcastPkts InDropPkts Rx Error
--------- ---------------- ---------------- ---------------- ---------------- ---------------- ----------------
0/1 63884683472614 34223820975 116925 80962 5 35838
0/2 265584648397991 43844458781 116922 81071 1 1961079
Port OutOctets OutUcastPkts OutMcastPkts OutBcastPkts OutDropPkts Tx Error
--------- ---------------- ---------------- ---------------- ---------------- ---------------- ----------------
0/1 265607061634499 43843431844 1638223 565759 1952351 1952351
0/2 63884090686727 34225894361 1638180 565624 35018 35015
- 更换节点端的SFP不会停止这些错误。
- 根据
network device-discovery show
输出、报告存储错误的所有节点/端口均可连接到同一交换机。