从1343620关闭后、两个脱机节点均无法启动
适用场景
- ONTAP 9
问题描述
- 在BMC更新期间、控制器无响应并根据错误 1343620的症状重新启动
- 在发生崩溃之前、Active IQ AutoSupport 警报包括:
HA Group Notification (SP HBT MISSED) NOTICE
HA Group Notification (SP HBT STOPPED) ALERT
- 两个节点都会在EMS中报告以下关闭事件:
[cluster-01: env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes.
- 登录到BMC系统控制台或串行控制台连接表示一个节点停留在启动环路中:
PANIC: NVRAM contents are invalid...
- 配对节点在关闭期间发生崩溃、无法启动、
Waiting for reservations to clear
因为配对节点已接管并自行关闭 - 配对系统崩溃字符串:
Shutdown taking longer than 930 seconds in process nodewatchdog on release 9.10.1P4 (C)