在ONTAP更新期间、聚合挂载停止并可能发生软件崩溃
适用场景
- ONTAP 9
- 崩溃
问题描述
- 在ONTAP更新过程中、聚合挂载停止、并且在接管期间可能发生软件崩溃
Cluster1::storage failover> aggr show -state !online
Aggregate Size Available Used% State #Vols Nodes RAID Status
--------- -------- --------- ----- ------- ------ ---------------- ------------
aggr1 336.1TB 325.1TB 3% mounting 0 Cluster1_A raid_tec,normal
- ONTAP更新遇到错误
takeover failed:
Cluster1::> cluster image show-update-progress
Estimated Elapsed
Update Phase Status Duration Duration
-------------------- ----------------- ------------------------------
Pre-update checks completed 00:10:00 00:01:14
ONTAP updates paused-on-error 01:32:00 00:10:05
Details:
Node name Status Status Description
-------------------- ----------------- --------------------------------------
Cluster1_A waiting
Cluster1_B failed Error: Takeover failed.
Action: Use the "storage failover show-takeover" command to view the
cause of takeover failure and the suggested corrective actions.
When all issues are resolved, use the "cluster image resume-update" command
- 在以下情况下可能会观察到崩溃:
Mon May 15 05:01:39 -0700 [Cluster1_B: cf_main: sk.panic:alert]: Panic String: Failover Monitor: unable to transit - takeover process is hung (wafl) in SK process cf_main on release 9.10.1P8 (C)