由于dogdog_Reboot、BMC正在重新启动、但在AFF A400中、节点未发生崩溃
适用场景
- AF-A400
- SP/BMC 13.10P1、13.11 和 13.11P1
问题描述
- node01 的 BMC 随机重启
HW watchdog reboot
但节点没有出现恐慌/重启现象。
b99 | 06/23/2024 | 10:22:55 | Watchdog_Reboot #0xbc | HW watchdog reboot | Asserted
bb0 | 07/04/2024 | 20:32:46 | Watchdog_Reboot #0xbc | HW watchdog reboot | Asserted
bc4 | 07/13/2024 | 13:59:37 | Watchdog_Reboot #0xbc | HW watchdog reboot | Asserted
bf9 | 08/10/2024 | 18:23:04 | Watchdog_Reboot #0xbc | HW watchdog reboot | Asserted
- BMC 监督事件
boot_time.log
:
IPMI_Main.c main start 10.59 7.84
IPMI_Main.c main after sync time 14.82 12.92 Wed Feb 24 13:29:00 GMT 2021
BMC init unknown:Wed Feb 24 13:29:10 GMT 2021
GPIO boot : Primary
Physical slot : #1
Primary env : active:#1 inactive:#1
Last boot error : HW watchdog timeout happened last time! Try to update your inactive flash again!
BMC init 3 6:Wed Feb 24 13:41:21 GMT 2021
GPIO boot : Primary
Physical slot : #1
Primary env : active:#0 inactive:#0
Last boot error : HW watchdog timeout happened last time! Try to update your inactive flash again!