在发生多个 ECC 错误后,计算节点在 vCenter 中停留在无响应状态
适用场景
NetApp H700E
问题描述
- 节点启动
- 系统事件日志( SEL )中显示多个 ECC 错误( x 和 y 值取决于 DIMM 位置):
ID,Critical,<DATE TIME>,BIOS OEM(Memory Error),Failing DIMM: DIMM location (Correctable memory component found) (Px-DIMMy) - Assertion
ID,Critical,<DATE TIME>,BIOS OEM(Memory Error),(runtime) Failing DIMM: DIMM location. (Px-DIMMy) - Assertion
Id,Warning,<DATE TIME>,Processor(OEM),Configuration Error - Assertion
- 主机在 vCenter 上无响应