跳转到主内容

Coming soon...New Support-Specific categorization of Knowledge Articles in the NetApp Knowledge Base site to improve navigation, searchability and your self-service journey.

AF-A250 关闭无法启动,并指示硬件出现故障

Views:
13
Visibility:
Public
Votes:
0
Category:
aff-series<a>崩溃</a><a>夹层卡</a><a>A250</a><a>2008863671</a>
Specialty:
hw
Last Updated:

适用场景

  • AFF-A250
  • ONTAP 9

问题描述

  • 节点关闭并显示以下崩溃消息:

PANIC: Uncorrectable Machine Check Error at CPU9. SKL_IIO Error: STATUS<0xbb80000000000e0b>(VALID,UC,EN,MISCV,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0),MCACOD(0xe0b))MISC<0x0000000064000000>(UCR_BUS_LOG(100),UCR_DEVICE_LOG(0),UCR_FUNCTION_LOG(0),UCR_SEGMENT_LOG(0))IIO Machine Check from device(s):RPT(100,0,0):ErrSrcID(CorrSrc(0),UCorrSrc(0x6660)), PLX PCIE 9797 switch on Controller, Br[9797](102,12,0): Link down. . in process idle: cpu9 on release 9.8P3 (C) on Mon Aug 2 00:23:10 CEST 2021 version: 9.8P3: Sat Mar 27 04:59:49 EDT 2021

  • 诊断启动失败:

[   55.095414] mce: [Hardware Error]: CPU 0: Machine Check Exception: 5 Bank 6: bb80000000000e0b
[   55.104912] mce: [Hardware Error]: RIP !INEXACT! 10:<ffffffff81786e4f> 
[   55.112086] {mwait_idle+0x6f/0x160} mce: [Hardware Error]: TSC 49a9a022c2 MISC 64000000
[   55.121240] mce: [Hardware Error]: PROCESSOR 0:50654 TIME 1627869582 SOCKET 0 APIC 0 microcode 2006906
[   55.131605] mce: [Hardware Error]: Run the above through 'mcelog --ascii'
[   55.139166] mce: [Hardware Error]: Machine check: Processor context corrupt
[   55.146909] Kernel panic - not syncing: Fatal machine chec

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

Scan to view the article on your device