由于节点上的 CPU 灾难性错误而导致的系统接管
适用于
AFF-A90
问题
- 节点重新启动,没有任何死机字符串或错误消息
- BMC CLI 命令
bmc status -d显示CPU Catastrophic Error正在断言和取消断言。
root: eventfifod 446.567: 123(0x007b) : CPU Catastrophic Error asserted
root: eventfifod 446.567: 123(0x807b) : CPU Catastrophic Error de-asserted
root: eventfifod 447.367: 126(0x007e) : CPU Error Level 2 asserted
root: eventfifod 470.514: 126(0x807e) : CPU Error Level 2 de-asserted
root: eventfifod 472.981: 123(0x007b) : CPU Catastrophic Error asserted
root: eventfifod 472.981: 123(0x807b) : CPU Catastrophic Error de-asserted
root: eventfifod 762.422: 95(0x005f) : NMI Trigger to PCH asserted
root: eventfifod 762.425: 97(0x8061) : LVC3 CPU0 NMI asserted
root: eventfifod 762.425: 98(0xc062) : LVC3 CPU1 NMI asserted
root: eventfifod 762.425: 131(0x4083) : PCH NMI Request from BMC asserted