跳转到主内容

节点重新启动后、存储端口关闭

Views:
Visibility:
Public
Votes:
0
Category:
ontap-9
Specialty:
CORE<a>2010024495</a>
Last Updated:

适用场景

  • AFF A900
  • 安装了RCF1.11的Cisco Nexus 9336交换机
  • ONTAP 9

问题描述

  • 重新启动节点且存储端口 随机关闭后、EMS发出以下警报。

 Health Monitor process nchm:NoPathToNSMA_Alert…

  • sysconfig -a, System Storage Configuration更 Quad-Path HA 改为 Single-Path HA 、E2A将自动显示为未知FD-down。

Node: node-1
     NetApp Release 9.14.1P4: Thu Apr 18 11:59:10 EDT 2024
     System ID: XXXXXXXXXX (node-1); partner ID: XXXXXXXXXX (node-2)
     System Serial Number: XXXXXXXXXXXX (node-1)
 System Rev: C0
     System Storage Configuration: Single-Path HA
     System ACP Connectivity: Inband Active
 SAS2/SAS3 Mixed Stack Support: all
     All-Flash Optimized: true
     Capacity Optimized: false
     All SAN Array: false
     Backplane Part Number: 111-02392
     Backplane Rev: C0
     Backplane Serial Number: XXXXXXXXXXXX
     slot 0: System Board 2.2 GHz (System Board XXIX B4)
         Model Name:      AFF-A900
         Part Number: 111-04824
         Revision:       B4
         Serial Number:    XXXXXXXXXXXX
         BIOS version:   18.9
         Loader version:    8.1.0
         Boot Flash:      Primary
         Processors: 64



     slot 2: Dual 40G/100G/200G Ethernet Controller CX6
         e2a MAC Address:   XX:00:XX:11:XX:XX (auto-unknown-fd-down)
   QSFP Vendor:      Amphenol
           QSFP Part Number:   112-00574
           QSFP Serial Number: APFXXXXXXXXXXXX
         e2b MAC Address:   XX:22:XX:33:XX:XX (auto-unknown-fd-down)
           QSFP Vendor:
        QSFP Part Number:
           QSFP Serial Number:
         Device Type:     CX6 PSID(NAP0000000018)
         Firmware Version:   20.30.1004
         Part Number:     111-04739
         Hardware Revision:  A2

  • 重新启动后、EMS日志会显示存储端口E2A/e10b上的链路启动/关闭。

[?] Thu May 09 10:12:45 +0900 [node-1: kernel: netif.linkDown:info]: Ethernet e2a: Link down, check cable.
[?] Thu May 09 10:23:07 +0900 [node-1: kernel: netif.linkDown:info]: Ethernet e10b: Link down, check cable.


[?] Thu May 09 10:12:45 +0900 [node-2: kernel: netif.linkDown:info]: Ethernet e2a: Link down, check cable.
[?] Thu May 09 10:23:07 +0900 [node-2: kernel: netif.linkDown:info]: Ethernet e10b: Link down, check cable.

  • 交换机日志显示已连接端口出现错误"unsupported (不支持)"、但Hardware Universe确认此端口受支持。

[2024-05-09 19:15:36.280] 2024 May 9 01:27:10 switch02 %ETHPORT-5-IF_HARDWARE: Interface Ethernet1/11, hardware type changed to 100G
[2024-05-09 19:15:36.280] 2024 May 9 01:27:10 switch02 %ETHPORT-3-IF_UNSUPPORTED_TRANSCEIVER: Transceiver on interface Ethernet1/11 is not supported
[2024-05-09 19:15:36.311] 2024 May 9 01:27:11 switch02 %ETHPORT-5-IF_HARDWARE: Interface Ethernet1/12, hardware type changed to 100G
[2024-05-09 19:15:36.311] 2024 May 9 01:27:11 switch02 %ETHPORT-3-IF_UNSUPPORTED_TRANSCEIVER: Transceiver on interface Ethernet1/12 is not supported
[2024-05-09 19:21:17.984] 2024-05-09T09:02:43.200667000+00:00 [M 1] [ethpm] E_DEBUG Ifindex (Ethernet1/12)0x1a000000, SFP security check: unsupported vendor id 0x54
[2024-05-09 19:21:18.001] 2024-05-09T09:02:37.697065000+00:00 [M 1] [ethpm] E_DEBUG Ifindex (Ethernet1/11)0x1a000000, SFP security check: unsupported vendor id 0x54

 

  • storage port show 显示了一个存储端口在重新启动后随机关闭。

                    Speed            VLAN
 Node         Port Type  Mode   (Gb/s) State   Status    ID
 ------------------ ---- ----- ------- ------ -------- --------- ----
 node-1
           e10a ENET  network    - -     -       -
           e10b ENET  storage   100 enabled  online    30
           e11a ENET  network    - -     -       -
           e11b ENET  network    - -     -       -
           e2a  ENET  storage    0 enabled  offline   30
           e2b  ENET  network    - -     -       -
 node-2
           e10a ENET  network    - -     -       -
           e10b ENET  storage   100 enabled  online    30
           e11a ENET  network    - -     -       -
           e11b ENET  network    - -     -       -
           e2a  ENET  storage   100 enabled  online    30
           e2b  ENET  network    - -     -       -
12 entries were displayed.

  • 在ifstat中未发现错误/丢弃、但我们看到 Up to down 指示发生了8次轻拍。 

-- interface  e2a  (1 day, 20 hours, 12 minutes, 38 seconds) --

RECEIVE
 Total frames:    5938  | Frames/second:     0  | Total bytes:    1721k
 Bytes/second:     11  | Total errors:     0  | Errors/minute:     0
 Total discards:    0  | Discards/minute:    0  | Multi/broadcast:  4773
 Non-primary u/c:    0  | Errored frames:    0  | Unsupported Op:    0
 CRC errors:      0  | Runt frames:      0  | Fragment:       0
 Long frames:      0  | Jabber:        0  | Length errors:     0
 Alignment errors:   0  | No buffer:       0  | Pause:         0
 Jumbo:         0  | Error symbol:     0  | Bus overruns:     0
 Queue drops:      0  | LRO segments:     0  | LRO bytes:       0
 LRO6 segments:    887  | LRO6 bytes:     122k | Bad UDP cksum:     0
 Bad UDP6 cksum:    0  | Bad TCP cksum:     0  | Bad TCP6 cksum:    0
 Mcast v6 solicit: 3691  | Lagg errors:      0  | Lacp errors:      0
 Lacp PDU errors:    0
TRANSMIT
 Total frames:    2420  | Frames/second:     0  | Total bytes:     242k
 Bytes/second:     2  | Total errors:     0  | Errors/minute:     0
 Total discards:    0  | Queue overflow:    0  | Multi/broadcast:   950
 Collisions:      0  | Pause:         0  | Jumbo:         0
 Cfg Up to Downs:    2  | TSO segments:     0  | TSO bytes:       0
 TSO6 segments:     0  | TSO6 bytes:      0  | HW UDP cksums:     0
 HW UDP6 cksums:   105k | HW TCP cksums:     0  | HW TCP6 cksums:   1413
 Mcast v6 solicit:  106k | Lagg drops:      0  | Lagg no buffer:    0
 Lagg no entries:    0
DEVICE
 Mcast addresses:    7  | Rx MBuf Sz:     9216
LINK INFO
 Speed:        100G | Duplex:       full | Flowcontrol:     none
 Media state:    active | Up to downs:      8 | HW assist:      514k

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.