HCI 计算主机崩溃,并出现不可更正的内存错误
适用场景
- NetApp HCI计算节点
- VMware ESXi
问题描述
- ESXi主机在 中遇到PSOD (紫色诊断屏幕)
uncorrectable memory error for DIMM
- 可能的BMC系统事件日志(SEL):
(runtime) Failing DIMM: DIMM location. (PX-DIMMAX) - Assertion" and "Uncorrectable ECC @PX-DIMMAX(CPUX) - Assertion"
Memory(OEM) Uncorrectable ECC / other uncorrectable memory error @P2-DIMMC2(CPU2) - Assertion
[Memory Error] [Memory] Uncorrectable ECC(CPUX_BX) - Asserted"
BIOS OEM(Memory Error) Post package repair fail. (P2-DIMMC2) - Assertion
BIOS OEM(Memory Error) Memory signal is too marginal. (P2-DIMMC2) - Assertion
BIOS OEM(Memory Error) (runtime) Failing DIMM: DIMM location. (P1-DIMMA3) - Assertion