由于主板出现故障、电源状态严重
适用场景
- FAS8200
- AFF-A300
问题描述
- 由于未检测到检测到检测信号、发生接管。
[cf.fsm.takeover.noHeartbeat:ALERT]: Failover monitor: Takeover initiated after no heartbeat was detected from the partner node.
[cf.fm.takeoverComplete:notice]: (EMS parameters: token="XXXXXXXXXXX_13:44:29_2024:12:14" partner_node_uuid="XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX")
EMS
显示 PSU错误并频繁自动恢复。
[node1: env_mgr: monitor.chassisPowerSupply.degraded:notice]: Chassis power supply 1 is degraded: PSU1 Temperature is Unreadable
[node1: power_low_monitor: monitor.chassisPower.degraded:alert]: Chassis power is degraded: Power Supply Status Critical: PSU1.
[node1: power_low_monitor: callhome.chassis.power:error]: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU1.
[node1: monitor: monitor.globalStatus.critical:EMERGENCY]: Power Supply Status Critical: PSU1.
[node1: spsm_listener: sp.heartbeat.stopped:error]: Have not received a IPMI heartbeat from the Service Processor (SP) in last 20 seconds.
[node1: spsm_listener: sp.heartbeat.resumed:info]: Received IPMI heartbeat from the Service Processor (SP).
[node1: power_low_monitor: monitor.chassisPowerSupplies.ok:info]: Chassis power supplies OK.
[node1: monitor: monitor.globalStatus.ok:notice]: The system's global status is normal.
SP-LATEST-IPMI
显示多个板载传感器处于严重状态。
CPU0_Temp_Margin | na | degrees C | na | na | na | -5.000 | 0.000
PSU2_Fault | 0x0 | discrete | Asserted | na | na | na | na
Bat_1.5V | 1.765 | Volts | cr | 1.280 | 1.348 | 1.649 | 1.727
- 配对节点不具有相同的问题描述、并将PSU状态报告为正常。