跳转到主内容

由于集群端口出现错误、已多次报告集群网络已降级警报

Views:
35
Visibility:
Public
Votes:
0
Category:
fas-systems<a>AFF FAS</a><a /><a>CN1610</a><a>CRC</a><a>集群网络已降级</a><a>109989</a>
Specialty:
hw
Last Updated:

适用场景

  • ONTAP 9
  • 集群交换机

问题描述

  • 多次收到以下集群网络降级AutoSupport 通知:

HA Group Notification (CLUSTER NETWORK DEGRADED) ALERT

  • 除IFstat输出的接收部分所示的一个集群端口之外、所有节点的集群端口都会报告多个错误。
  • system health alert show 在命令输出中、您可能会看到集群交换机运行状况已降级
  • 存在多个错误的集群端口(可能会同时出现CRC错误、错误符号、非法符号等):

clustershell::>system node run -node <nodename> -command ifstat -a

-- interface e0b (219 days, 5 hours, 16 minutes, 2 seconds) --
RECEIVE
 Frames/second:   2301  | Bytes/second:    3785k  | Errors/minute:      0
 Discards/minute:   0  | Total frames:    154g  | Total bytes:      176t
 Total errors:   43918 | Total discards:    65  | Multi/broadcast:   4452k
 No buffers:      0  | Non-primary u/c:    0  | L2 terminate:     14908
 Tag drop:         0  | Vlan tag drop:     0  | Vlan untag drop:     0
 Vlan forwards:    0  | CRC errors:      29328  | Runt frames:       0
 Fragment:         0  | Long frames:      65  | Jabber:         0
 Error symbol:    29328  | Illegal symbol:   14590  | Bus overruns:      0
 Queue drop:      0  | Xon:           0  | Xoff:          0
 Jumbo:      5634k  | JMBuf RxFrames:   162g  | JMBuf DrvCopy:    27146

  • 无错误的单个集群端口:

clustershell::>system node run -node <nodename> -command ifstat -a

-- interface e0b (219 days, 7 hours, 2 minutes, 24 seconds) --
RECEIVE
 Frames/second:   1092  | Bytes/second:     950k  | Errors/minute:      0
 Discards/minute:    0  | Total frames:    47631m  | Total bytes:      107t
 Total errors:      0 | Total discards:   1159  | Multi/broadcast:   4473k
 No buffers:    1087  | Non-primary u/c:     0  | L2 terminate:       302
 Tag drop:         0  | Vlan tag drop:      0  | Vlan untag drop:      0
 Vlan forwards:    0  | CRC errors:       0 | Runt frames:         0
 Fragment:         0  | Long frames:      50  | Jabber:         0
 Error symbol:       0  | Illegal symbol:     0  | Bus overruns:       22
 Queue drop:      0  | Xon:           0  | Xoff:               0
 Jumbo:      2769m  | JMBuf RxFrames:     0  | JMBuf DrvCopy:      0

  • 单个集群端口报告 ifconfig -vvv 输出的毫瓦较低: 

::> system node run -node <nodename> -command ifconfig -vvv

e0b: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> metric 0 mtu 9000
    uuid: 0320a80b-caa3-11eb-b14a-d039ea306760
    ...
   RX: 0.06 mW (-12.13 dBm) TX: 0.55 mW (-2.59 dBm)

  • 您可能会在EMS日志和生成的运行状况警报中看到以下消息

netif.linkErrors: Excessive link errors on network interface e2c. Might indicate a bad cable, switch port, or NIC, or that a
cable connector is not fully inserted in a socket. On a 10/100 port, might indicate a duplex mismatch.

  • 对于唯一节点端口的已连接交换机端口、可能会生成以下警报、但不会收到CRC错误:

[?] Tue Nov 01 18:13:10 -0700 [node-01: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: ClusterIfInErrorsWarn_Alert[switch01(FOC123456789)/Ethernet1/9].

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

Scan to view the article on your device