跳转到主内容

正在等待 ANDU 死机重启后清除预订

Views:
Visibility:
Public
Votes:
0
Category:
aff-series
Specialty:
HW
Last Updated:

适用于

问题

  • 在 ANDU ONTAP 升级期间,在第一个节点被接管、完成升级并重新启动后,HA 互连链路断开
  • 接管节点可能已死机或在接管后的某个时间点被重新启动
  • 升级后的节点将仅启动到"等待保留清除"状态,并且无法进行回馈(由于未处于"等待回馈"状态)
  • 由于缺乏工作的 HA 互连,ANDU 进程将停止并且无法恢复
  • HA 互连的两个物理链路都已启动并且集群端口正常工作
  • 只有 RDMA 链路断开:

    cluster::> system ha interconnect status show            
    Node: node1
           Link 0 Status: up
           Link 1 Status: up
          Is Link 0 Active: true      
          Is Link 1 Active: true     
          IC RDMA Connection: down

    Warning: Unable to list entries on node "node2". RPC: Couldn't make connection [from mgwd on node "node1" (VSID: -1) to kernel at 169.254.200.103]

    Error: show failed: RPC: Couldn't make connection [from mgwd on node "node1" (VSID: -1) to kernel at 169.254.200.103]  * waiting for reservations to clear is shown in node2's console


  • 在故障排除期间,重新启动断开的节点、重新启动正常的节点、重新插拔互连电缆以及重新启动集群交换机对 RDMA 链路状态没有影响

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.