跳转到主内容

ONTAP 从 9.8 或 9.9 升级到 9.10.1 或更高版本后,MGWD 在所有节点上崩溃/重新启动

Views:
17
Visibility:
Public
Votes:
0
Category:
ontap-9
Specialty:
core
Last Updated:

适用场景

  • ONTAP
  • 从 ONTAP 9.8 或 9.9 的某些版本升级到 9.10.1 或更高版本

问题描述

  • mgmt 在一个或多个节点升级到 ONTAP 9.10.1 后,集群应用程序不断重新启动并在所有节点上在脱机和联机之间切换

clus1::*> cluster ring show
Node    UnitName Epoch   DB Epoch DB Trnxs Master   Online
--------- -------- -------- -------- -------- --------- ---------
clus1-01  mgmt   0     1208    45     -     offline
clus1-01  vldb    32     32     525821   clus1-01  master
clus1-01  vifmgr   104    104    889174   clus1-01  master
clus1-01  bcomd   32     32     2879    clus1-01  master
clus1-01  crs    32     32     745    clus1-01  master
clus1-02  mgmt   0     1208    45     -     offline
clus1-02  vldb    32     32     525821   clus1-01  secondary
clus1-02  vifmgr   104    104    889174   clus1-01  secondary
clus1-02  bcomd   32     32     2879    clus1-01  secondary
clus1-02  crs    32     32     745    clus1-01  secondary
clus1-03  mgmt   0     1208    45     -     offline
clus1-03  vldb    32     32     525821   clus1-01  secondary
clus1-03  vifmgr   104    104    889174   clus1-01  secondary
clus1-03  bcomd   32     32     2879    clus1-01  secondary
clus1-03  crs    32     32     745    clus1-01  secondary
clus1-04  mgmt   0     1208    45     -     offline
clus1-04  vldb    32     32     525821   clus1-01  secondary
clus1-04  vifmgr   104    104    889174   clus1-01  secondary
clus1-04  bcomd   32     32     2879    clus1-01  secondary
clus1-04  crs    32     32     745    clus1-01  secondary

  • 由于其他集群应用程序保持在线,因此数据可以继续提供服务,尽管当管理处于离线状态时,节点将无法查看聚合的状态(aggr show将报告状态中其他节点的聚合unknown) 并可能报告如下消息:

Info: Node clus1-04 that hosts aggregate aggr1 is offline

  • MGWD 日志显示 SQL 插入错误sp_cap_rdb
[kern_mgwd:info:2136] 0x828839500: SQL error: "INSERT INTO sp_cap_rdb(rowid, _epoch, _tid, [node], [nodeid], [id], [version]) VALUES (-350277020502054092, 828, 141, 'clus1-04', 'xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx', 33, 2);" UNIQUE constraint failed: sp_cap_rdb.node, sp_cap_rdb.id(19)
[kern_mgwd:info:2136] 0x828839500: 0: ERR: SQL_CONTEXT: execute_sql:src/sql_context.cc:836 SQL: failed on connection 0x81efa7308: UNIQUE constraint failed: sp_cap_rdb.node, sp_cap_rdb.id(19), txn: 'saveTxnChanges:sp_cap_rdb create',active_connection: 0x81efa7308, active_thread: 0x828839500, active_label: 'saveTxnChanges:sp_cap_rdb create', stmt: "INSERT INTO sp_cap_rdb(rowid, _epoch, _tid, [node], [nodeid], [id], [version]) VALUES (-350277020502054092, 828, 141, 'clus1-04', 'xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx', 33, 2);"
[kern_mgwd:info:2136] E [src/rdb/sql_local_unit.cc 5116 (0x828839500)]: saveTxnChanges: failed to execute SQL: 'INSERT INTO sp_cap_rdb(rowid, _epoch, _tid, [node], [nodeid], [id], [version]) VALUES (-350277020502054092, 828, 141, 'clus1-04', 'xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx', 33, 2);'.
[kern_mgwd:info:2136] W [src/rdb/sql_local_unit.cc 5288 (0x828839500)]: saveTxnChanges: abandoning due to INTERNAL_ERROR.

 

 

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.