处理FAS2520/FAS2552/FAS2554上的L2监视器重置
适用场景
- FAS2520 / FAS2552 / FAS2554
问题描述
- 节点意外重新启动
- 节点在意外关闭后不会重新启动
受影响节点上的服务处理器日志显示以下内容:
Record 801: Sun Mar 06 15:09:20.924775 2021 [IPMI Event.critical]: L2 watchdog timeout hard reset
Record 802: Sun Mar 06 15:09:20.984259 2021 [Trap Event.critical]: hwassist l2_watchdog_reset (29)
Record 803: Sun Mar 06 15:09:23.000822 2021 [SP.critical]: Filer Reboot
- 如果节点重新启动、则在EMS日志文件中会显示以下错误
[cluster-01:mgr.boot.reason_abnormal:EMERGENCY]: System rebooted due to a watchdog reset.
- 如果节点无法重新启动、
system senors
则SP可能会显示传感器不可用(na
)或出现故障(Fault
)
Sensor Name | Current | Unit | Status | LCR | LNC | UNC | UCR
-----------------+------------+------------+------------+-----------+-----------+-----------+-----------
SYSTEM:
System_FW_Status | na | discrete | na | na | na | na | na
System_Watchdog | 0x0 | discrete | | na | na | na | na
Wrench_Port_Up | na | discrete | na | na | na | na | na
CONTROLLER_A:
PCM_Status | 0x0 | discrete | Fault | na | na | na | na
Attn_Sensor1 | 0x0 | discrete | Asserted | na | na | na | na
CPU-1_DTS_Temp | na | degrees C | na | na | na | -10.000 | 0.000
CPU-2_DTS_Temp | na | degrees C | na | na | na | -10.000 | 0.000
CPU0_PVCCP | na | Volts | na | 1.580 | 1.670 | 1.920 | 2.010
CPU1_PVCCP | na | Volts | na | 1.580 | 1.670 | 1.920 | 2.010