由于集群端口上的CRC错误较高、集群网络已降级
适用场景
- ONTAP 9
- FAS/AFF系统
- NetApp集群交换机
问题描述
- 由于CRC错误、集群网络降级、并且事件日志中出现以下错误:
[Node-01: intr: netif.linkErrors:error]: Excessive link errors on network interface e0b. Might indicate a bad cable, switch port, or NIC, or that a cable connector is not fully inserted in a socket. On a 10/100 port, might indicate a duplex mismatch.
[Node-01: vifmgr: vifmgr.cluscheck.hwerrors:alert]: Port e0a on node Node-01 is reporting a high number (at least 1 per 1000 packets) of observed hardware errors (CRC, length, alignment, dropped).
[Node-01: vifmgr: callhome.clus.net.degraded:alert]: Call home for CLUSTER NETWORK DEGRADED: CRC Errors Detected - High CRC errors detected on port e0a node Node-01
- 在所有节点的集群端口上观察到较高的CRC错误:
::> system node run -node <node-name> -command ifstat <port-name>
-- interface e0a (4 days, 14 hours, 42 minutes, 47 seconds) --
RECEIVE
Total frames: 86771k | Frames/second: 218 | Total bytes: 289g
Bytes/second: 727k | Total errors: 65389 | Errors/minute: 10
Total discards: 0 | Discards/minute: 0 | Multi/broadcast: 121k
Non-primary u/c: 0 | CRC errors: 22207 | Runt frames: 0
Fragment: 0 | Long frames: 0 | Jabber: 41971
Length errors: 1211 | No buffer: 0 | Xon: 0
Xoff: 0 | Pause: 0 | Jumbo: 31475k
Noproto: 0 | Error symbol: 243k | Illegal symbol: 217k
Bus overruns: 0 | Queue drops: 0 | LRO segments: 62544k
- 此外、还会在交换机侧观察到CRC错误和端口盖。
- 更换节点端的SFP不会停止这些错误。