意外节点重启,出现"Power on Reset(Normal Power Cycle)"事件
适用于
- AFF 系统
- FAS 系统
- 服务处理器 (SP)
- 基板管理控制器(BMC)
问题
- 意外节点重新启动
- SP 或 BMC 引发"
Power Reset"事件:
[IPMI.notice]: 9200 | c0 | OEM: ffff7000ff00 | ManufId: 150300 | SP Power Reset
[IPMI.notice]: 9300 | c0 | OEM: fcff70560000 | ManufId: 150300 | POS Register: Power on Reset(Normal Power Cycle)
或
[IPMI.notice]: 00d4 | c0 | OEM: ffff7000ff00 | ManufId: 150300 | BMC Power Reset
[IPMI.notice]: 00d5 | c0 | OEM: fcff70560000 | ManufId: 150300 | POS Register: Power on Reset(Normal Power Cycle)
- 由于错过心跳或失去通信而导致合作伙伴接管
[node_name: cf_main: cf.fsm.takeover.noHeartbeat:alert]: Failover monitor: Takeover initiated after no heartbeat was detected from the partner node.
或
cf_fastTimeout: cf.ic.heartBeatFailed:error]: HA interconnect: Heartbeat failed
或
cf_takeover: callhome.sfo.takeover:alert]: Call home for CONTROLLER TAKEOVER COMPLETE AUTOMATIC - Communication Error
- 没有其他可疑错误消息。