BMC "心跳停止"事件导致节点使用 BMC 13.12 重新启动
适用于
- AFF-A400
- ONTAP 9
- BMC Firmware 13.12
问题
由于 BMC 心跳丢失,节点意外重新启动:
[node-01: spmgrd: sp.heartbeat.stopped:info]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 600 seconds.[node-01: spmgrd: sp.heartbeat.stopped:info]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 600 seconds.[node-01: spmgrd: callhome.sp.hbt.missed:notice]: Call home for SP HBT MISSED[node-01: spmgrd: callhome.sp.hbt.stopped:alert]: Call home for SP HBT STOPPED[node-01: env_mgr: sp.ipmi.lost.shutdown:EMERGENCY]: SP heartbeat stopped and cannot be recovered. To prevent hardware damage and data loss, the system will shut down in 10 minutes.[node-01: env_mgr: monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (System reboot to recover the BMC)