跳转到主内容

shutdown pending (degraded mode) critical—AutoSupport 消息

Views:
13
Visibility:
Public
Votes:
0
Category:
ontap-9
Specialty:
hw
Last Updated:

适用场景

  • ONTAP 9
  • callhome.shutdown.pending
  • monitor.brokenDisk
  • node_name发出HA组通知(关闭待定(降级模式))警报

事件摘要

如果磁盘驱动器发生故障、但没有适合重建的备用磁盘、则会显示此消息。

  • 为了保护数据、系统将进入降级模式。
  • 如果系统在设置的时间间隔内以降级模式运行、则系统会自动暂停以防止双磁盘驱动器出现故障并可能丢失数据。
  • 默认超时通常为24小时。
  • 如果系统在降级模式下运行时备用驱动器变为可用、系统将立即开始重建故障驱动器。

验证

事件日志

event log show -severity * -message-name callhome*

[node1: statd: callhome.shutdown.pending:alert]: Call home for SHUTDOWN PENDING (degraded mode)

event log show -severity * -message-name monitor.brokenDisk*

[node1: statd: monitor.brokenDisk.notice:info]: When two disks are broken in raid_dp volume, the system shuts down automatically every 24 hours to encourage you to replace the disk. If you reboot the system it will run for another 24 hours before shutting down. (The 24 hour timeout may be increased by altering the "raid.timeout" value using the "options" command.)

[node1: statd: monitor.shutdown.brokenDisk.pending:notice]: two data disks in RAID group "/aggregate_name/plex0/rg0" are broken. Halting system in 24 hours.

命令行

验证聚合状态、运行 storage aggregate show-status

RAID group /aggregate_name/plex0/rg1 (double degraded, block checksums) RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks) --------- ------ ------------- ---- ---- ---- ----- -------------- -------------- dparity 0b.07.12 0b 7 12 SA:B 0 SAS 10000 1713523/3509295616 1716957/3516328368 parity 0b.07.13 0b 7 13 SA:B 0 SAS 10000 1713523/3509295616 1716957/3516328368 data FAILED N/A 1713523/ - data 0b.07.15 0b 7 15 SA:B 0 SAS 10000 1713523/3509295616 1716957/3516328368 data FAILED N/A 1713523/ - data 0b.07.21 0b 7 21 SA:B 0 SAS 10000 1713523/3509295616 1716957/3516328368

 验证 故障转移状态、运行 storage failover show以验证 包含需要重建/清空的磁盘的聚合是否处于部分交还状态

storage failover show
                              Takeover
Node             Partner        Possible State Description
--------------   -------------- -------- -------------------------------------
Node-1           Node-2      true     Connected to Node-2, Partial giveback
Node-2           Node-1      true     Connected to Node-1.

 

解决方法

  1. 如果处于 部分交还 状态、请完成交还。请参阅 在部分交还状态下磁盘不重建或清空
  2. 更换所有故障驱动器。请参阅此知识库以检查"Part Status - Disk Failed - AutoSupport (部件状态-磁盘故障-故障)"消息

注意:如果需要帮助、请联系NetApp支持部门

Please contact NetApp Technical Support and reference this article for further assistance.

 

Scan to view the article on your device
CUSTOMER EXCLUSIVE CONTENT

Registered NetApp customers get unlimited access to our dynamic Knowledge Base.

New authoritative content is published and updated each day by our team of experts.

Current Customer or Partner?

Sign In for unlimited access

New to NetApp?

Learn more about our award-winning Support