由于集群端口/SFP连接错误、报告集群网络已降级警报
适用场景
- ONTAP 9
- 集群网络
问题描述
- 多次收到以下集群网络降级AutoSupport 通知:
HA Group Notification (CLUSTER NETWORK DEGRADED) ALERT
- 除IFSTAT输出的"接收"部分所示的一个集群端口外、所有节点的集群端口上都报告了多个错误。
system health alert show
您可能会在命令输出中看到集群交换机运行状况已降级- 存在多个错误的集群端口(可能会同时显示CRC错误、错误符号、非法符号等):
clustershell::>system node run -node <nodename> -command ifstat -a
-- interface e0b (219 days, 5 hours, 16 minutes, 2 seconds) --
RECEIVE
Frames/second: 2301 | Bytes/second: 3785k | Errors/minute: 0
Discards/minute: 0 | Total frames: 154g | Total bytes: 176t
Total errors: 43918| Total discards: 65 | Multi/broadcast: 4452k
No buffers: 0 | Non-primary u/c: 0 | L2 terminate: 14908
Tag drop: 0 | Vlan tag drop: 0 | Vlan untag drop: 0
Vlan forwards: 0 |CRC errors: 29328| Runt frames: 0
Fragment: 0 | Long frames: 65 | Jabber: 0
Error symbol: 29328 | Illegal symbol: 14590 | Bus overruns: 0
Queue drop: 0 | Xon: 0 | Xoff: 0
Jumbo: 5634k | JMBuf RxFrames: 162g | JMBuf DrvCopy: 27146
- 单个集群端口无错误:
clustershell::>system node run -node <nodename> -command ifstat -a
-- interface e0b (219 days, 7 hours, 2 minutes, 24 seconds) --
RECEIVE
Frames/second: 1092 | Bytes/second: 950k | Errors/minute: 0
Discards/minute: 0 | Total frames: 47631m | Total bytes: 107t
Total errors: 0| Total discards: 1159 | Multi/broadcast: 4473k
No buffers: 1087 | Non-primary u/c: 0 | L2 terminate: 302
Tag drop: 0 | Vlan tag drop: 0 | Vlan untag drop: 0
Vlan forwards: 0 |CRC errors: 0| Runt frames: 0
Fragment: 0 | Long frames: 50 | Jabber: 0
Error symbol: 0 | Illegal symbol: 0 | Bus overruns: 22
Queue drop: 0 | Xon: 0 | Xoff: 0
Jumbo: 2769m | JMBuf RxFrames: 0 | JMBuf DrvCopy: 0
- 您可能会在EMS日志和生成的运行状况警报中看到以下消息
netif.linkErrors: Excessive link errors on network interface e2c. Might indicate a bad cable, switch port, or NIC, or that a cable connector is not fully inserted in a socket. On a 10/100 port, might indicate a duplex mismatch.
- 可能会针对唯一节点端口的已连接交换机端口生成以下警报、但未收到CRC错误:
[?] Tue Nov 01 18:13:10 -0700 [node-01: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm:ClusterIfInErrorsWarn_Alert[switch01(FOC123456789)/Ethernet1/9].