StorageGRID设备警报:由于硬件计算机检查异常、节点意外重新启动
适用场景
NetApp StorageGRID设备
问题描述
StorageGRID报告警报
Unexpected node reboot
下载支持包并验证
base-os-logs/run/mount-tmp/pge-actv-root/var/log/storagegrid_crash_dmesg.DATE.log.gz
IT报告时:[8526393.622416] Disabling lock debugging due to kernel taint
[8526393.627902] mce: [Hardware Error]: CPU 0: Machine Check Exception: 5 Bank 12: b200003f000100b3
[8526393.636639] mce: [Hardware Error]: RIP !INEXACT! 10:<ffffffff9e4cae04> {native_queued_spin_lock_slowpath+0x54/0x190}
[8526393.647283] mce: [Hardware Error]: TSC 43d23fe1832b9a2
[8526393.652650] mce: [Hardware Error]: PROCESSOR 0:306e4 TIME 1683024361 SOCKET 0 APIC 0 microcode 428
[8526393.661732] mce: [Hardware Error]: Run the above through 'mcelog --ascii'
[8526393.672067] mce: [Hardware Error]: Machine check: Processor context corrupt
[8526393.679162] Kernel panic - not syncing: Fatal machine check