断电后、FlexGroup上的快照丢失、未尝试手动删除
适用场景
- 包含2个以上节点的ONTAP 9集群
- FlexGroup与跨越一个HA对的成分卷一起使用
- FlexGroup快照
问题描述
- 一个HA对中的两个节点:
- 拥有同一FlexGroup的成分卷。
- 在未完全关闭的情况下意外同时重新启动(例如、由于多磁盘崩溃或断电)。
- 同时重新启动。
- 重新启动后、一个或多个FlexGroup快照将完全丢失、
-
snapshot show
Size Total% Used%
如果只有部分成分卷丢失了相应的FlexGroup快照、则会从下显示:
::>set adv
Warning: These advanced commands are potentially dangerous; use them only when directed to do so by NetApp personnel.Do you want to continue? {y|n}: y
::*>vol show -vserver svm1 -volume MyFlexgroup1 -fields is-flexgroup
vserver volume is-flexgroup
------- ------------ ------------
svm1 MyFlexgroup1 true
::*>volume snapshot show -vserver svm1 -volume MyFlexgroup1
---Blocks---
Vserver Volume Snapshot Size Total% Used%
-------- -------- ------------------------------------- -------- ------ -----
svm1 MyFlexgroup1
MySnapshot1 - - -
hourly.2024-03-11_0905 360KB 0% 36%
2 entries were displayed.
1 entry was acted on.
::*>node run -node MyCluster-01 -command snap status MyFlexgroup1__0001
Node: MyCluster-01
Volume MyFlexgroup1__0001
snapid status date ownblks release fsRev name
------ ------ ------------ ------- ------- ----- --------
2 complete Mar 11 09:05 47 9.7 35092 hourly.2024-03-11_0905
1 complete Mar 11 09:00 47 9.7 35092 MySnapshot1
::*>node run -node MyCluster-02 -command snap status MyFlexgroup1__0002
Node: MyCluster-02
Volume MyFlexgroup1__0002
snapid status date ownblks release fsRev name
------ ------ ------------ ------- ------- ----- -------
2 complete Mar 11 09:05 47 9.7 35092 hourly.2024-03-11_0905
注意: 成分卷MyFlexgroup1__0002上缺少MySnapshot1
::*>snapshot show -vserver svm1 -volume MyFlexgroup1 -snapshot MySnapshot1 -fields state
vserver volume snapshot state
------- ------------ ----------- -----
svm1 MyFlexgroup1 MySnapshot1 unknown
- 如果删除的快照过多、则受影响节点可能会在重新启动后发生崩溃:
Panic_Message: timeout table full in SK process snap_lopri_work on release 9.11.1P8