StorageGRID 设备 BMC 模块报告 MCERR 和 MCE 错误
适用场景
- NetApp StorageGRID SG6000
- NetApp StorageGRID SG1000
- NetApp StorageGRID SG100
问题描述
- StorageGRID 设备计算节点上的琥珀色指示灯亮起。
- BMC 模块事件报告
MCERR
和MCE
错误。BMC 事件可以在 BMC UI 中捕获,也可以通过 SSH 捕获 - StorageGRID 报告:
Appliance compute controller needs attention
A hardware fault has been detected in the compute controller of a StorageGRID appliance
或Unable to communicate with node
示例:
Nov/18/2020 10:05:14 [Warning] [Additional MCE Error] [OEM Record C2] ManufacturerID:001C4C, Extra Information : 0 MSCOD:0030 MCACOD:0E0F
Nov/18/2020 10:05:14 [Critical] [MCERR] [Processor] Correctable Error - Machine Check Error: Bank 12/ CPU 1/ Core 0 - Asserted