SG6000卡住并报告不可用的自检结果:0x83 0x00 -已断言
适用场景
NetApp StorageGRID 设备SG6000
问题描述
StorageGRID设备意外重新启动(
unexpected node reboot
警报)和/或停留在BIOS启动屏幕上。BMC日志报告:
[Critical] [NM Status] [Management Subsystem Health] Controller access degraded or unavailable Self-Test Result:0x83 0x00 - Asserted
[Information] [SPS FW Health] [OEM] Firmware Status - Restricted Mode Info - Asserted
[Warning] [SPS FW Health] [OEM] Firmware Status - UMA Operation Error - Asserted
[Critical] [CATERR] [Processor] IERR - Asserted
[Critical] [CATERR] [Processor] Machine Check Exception (MCERR) - Asserted
[Information] [Extended PCIe Error] [OEM Record C0] ManufacturerID:001C4C/ VID:15B3/ DID:1015/ ErrorID 1:28/ SlotNo : 1-1
[Information] [Extended PCIe Error] [OEM Record C0] ManufacturerID:001C4C/ VID:15B3/ DID:1015/ ErrorID 1:22/ SlotNo : 1-1
[Critical] [PCIe Error] [Critical Interrupt] Bus Fatal (Bus1C/Dev0/Fun1) - Asserted
[Information] [Memory Error Dis] [Event Logging Disabled] Correctable Memory Error Logging Disabled - Asserted
[Critical] [Memory Error] [Memory] Correctable ECC Error Logging Limit Reached(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [Memory Error] [Memory] Parity Error(CPU0_F0) - Asserted
[Information] [NM Status] [Management Subsystem Health] Controller access degraded or unavailable - Deasserted
[Information] [Power Unit] [Power Unit] Power Off / Power Down - Deasserted
[Information] [SPS FW Health] [OEM] Firmware Status - Restricted Mode Info - Asserted
[Information] [Power Unit] [Power Unit] Power Off / Power Down - Asserted
[Warning] [Watchdog] [Watchdog 2] Power Cycle(Timer use at expiration: SMS/OS) - Asserted
[Critical] [NM Status] [Management Subsystem Health] Controller access degraded or unavailable Self-Test Result:0x83 0x00 - Asserted
[Warning] [SPS FW Health] [OEM] Firmware Status - UMA Operation Error - Asserted
[Warning] [SPS FW Health] [OEM] Firmware Status - UMA Operation Error - Asserted
[Warning] [SPS FW Health] [OEM] Firmware Status - UMA Operation Error - Asserted
[Information] [SPS FW Health] [OEM] Firmware Status - Restricted Mode Info - Asserted
[Warning] [SPS FW Health] [OEM] Firmware Status - UMA Operation Error - Asserted
[Warning] [SPS FW Health] [OEM] Firmware Status - UMA Operation Error - Asserted
[Warning] [SPS FW Health] [OEM] Firmware Status - UMA Operation Error - Asserted
[Warning] [SPS FW Health] [OEM] Firmware Status - UMA Operation Error - Asserted
[Information] [Extended PCIe Error] [OEM Record C0] ManufacturerID:001C4C/ VID:15B3/ DID:1015/ ErrorID 1:22/ SlotNo : 1-1
[Critical] [PCIe Error] [Critical Interrupt] Bus Uncorrectable Error (Bus1C/Dev0/Fun1) - Asserted