跳转到主内容

由于集群管理LIF迁移失败、ANDU已暂停

Views:
46
Visibility:
Public
Votes:
0
Category:
ontap-9
Specialty:
core
Last Updated:

适用场景

  • ONTAP 9
  • 自动化无中断升级(ANDU)

问题描述

  • ANDU 已暂停、并出现以下事件:

[Node-01: upgrademgr: upgrademgr.update.pausedErr:debug]: The automated update of the cluster has been paused due to the following reason:  Node "Node-02": Error: {Failed to migrate data LIFs from node "Node-01".}, Action: {Migrate all of the data LIFs using the "network interface migrate-all -node Node-01" command.}.
[Node-01: notifyd: callhome.andu.pausederr:alert]: params: {'epoch': '68XXXXXd-5XX6-4XX6-a068-6XXXXXXXXXb5', 'subject': 'AUTOMATED NDU PAUSED ON NODE: Node-02'}

Sat Jan 18 10:24:44 +0530 [Node-01: vifmgr: vifmgr.lifs.noredundancy:alert]: No redundancy in the failover configuration for 2 LIFs assigned to node "Node-01". LIFs: xxxx:Node-01_mgmt1, XXXX:cluster_mgmt
Sat Jan 18 10:24:44 +0530 [Node-01: vifmgr: vifmgr.lif.subnetMisconfig:error]: LIFs in subnet 10.254.xx.xx/23 of IPspace "Default" are configured on ports in multiple broadcast domains:  Default, Default-3

  • 已正确定义数据生命周期的故障转移目标、并且这些目标已成功迁移。
  • VIFMGR 日志中、我们发现、节点02对于两个节点的节点管理LIF以及集群管理LIF具有未定义的状态:

00000004.000035f3 02a0312c Thu Sep 28 2023 11:16:40 -07:00 [kern_vifmgr:info:6907] [0x80ac8fa00] [anon-ns::table_to_vifmgr_log] 1013 8 - undef active 4294967295 10.20.225.72 255.255.255.128 - - - Node-02_mgmt1 local-only up 0 mgmt true false -  false 101 8 Default-1 - - - fc07ebda-11c9-11ee-ac23-d039eaa8067f - - - up - true - - - - - - 4294967295 - - - - 4 - 1687957225 true Node-02 -
00000004.000035f4 02a0312c Thu Sep 28 2023 11:16:40 -07:00 [kern_vifmgr:info:6907] [0x80ac8fa00] [anon-ns::table_to_vifmgr_log] 1022 1 - undef active 4294967295 10.20.225.71 255.255.255.128 - - - Node-01_mgmt1 local-only up 0 mgmt true false -  false 101 1 Default - - - 5c586a2d-11c9-11ee-bcea-d039eaa7fe69 - - - up - true - - - - - - 4294967295 - - - - 4 - 1687957352 true Node-01 -

00000004.000035f7 02a0312c Thu Sep 28 2023 11:16:40 -07:00 [kern_vifmgr:info:6907] [0x80ac8fa00] [anon-ns::table_to_vifmgr_log] 1025 1 - undef active 4294967295 10.20.225.70 255.255.255.128 - - - cluster_mgmt broadcast-domain-wide up 0 mgmt true false -  false 101 1 Default - - 5e998d12-11c9-11ee-bcea-d039eaa7fe69 a10dc3f5-11c9-11ee-bcea-d039eaa7fe69 - - - up - true - - - - - - 4294967295 - - - false 4 - - false Node1-01 -

  • 检查广播域后、发现节点-02的e0M端口不在预期的默认广播域中:

IPspace Name                Cluster       Default       Default       Default

Layer 2 Broadcast Domain    Cluster       Default       Default-1     SVM
Broadcast Domain ID            1             2             3           4
Configured MTU              9000          1500          1500          1500
Ports                       Node-01:e0a
                            Node-01:e0b
                            Node-02:e0a   Node-01:e0M   Node-02:e0M   Node-01:a0a
                            Node-02:e0b

 

  • 因此、在ANDU过程中、无法将集群管理LIF迁移到节点02、从而导致ANDU暂停。

 

  • 在另一种情况下、节点-02的e0M端口不在任何广播 域中、这是由于节点在最初设置集群后加入。

IPspace Name                Cluster       Default                Default
Layer 2 Broadcast Domain    Cluster       Default-1              SVM
Broadcast Domain ID            1             2                    3
Configured MTU              9000          1500                   1500
Ports                       Node-01:e0a
                            Node-01:e0b
                            Node-02:e0a   Node-01:e0M            Node-01:a0a
                            Node-02:e0b  <No entry for Node -02>

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.

 

  • 这篇文章对您有帮助吗?