处理 AFF A250/C250、ASA A250/C250、FAS500f 上的 L2 看门狗重置
适用于
BMC 15.x - AFF A250、AFF C250、ASA A250、ASA C250、FAS500f
问题描述
- 节点意外重新启动,下方出现死机
watchdog nmi on cpu 2, hang cpu is -1 in process idle: cpu2 208 |XXX | 12:57:16 | Watchdog 2 #0x0f | Timer interrupt | Asserted
- 节点不会在意外关闭后重新启动,受影响节点上的 BMC 日志显示以下内容:
Record 402: Thu May 05 06:20:35.070000 2022 [ASUP.notice]: First notification email | (REBOOT (abnormal)) WARNING | Send failed
Record 403: Thu May 05 06:20:40.640000 2022 [IPMI.notice]: 0076 | 02 | EVT: 6fc302ff | System_Watchdog | Assertion Event, "Power cycle"
Record 404: Thu May 05 06:20:40.640000 2022 [IPMI Event.critical]: L2 watchdog timeout power cycle
- 如果节点重新启动,可以在 EMS 日志文件中看到以下错误
Thu May 05 15:33:43 +0800 [netapp: splog_main: mgr.boot.reason_abnormal:EMERGENCY]: System rebooted due to a watchdog reset.
Thu May 05 15:33:43 +0800 [netapp: splog_main: callhome.reboot.watchdog:alert]: Call home for REBOOT (watchdog reset)