由于 FAS8300 上的 MiniSAS HBA 卡故障,发生崩溃
适用场景
FAS8300
问题描述
带有崩溃字符串的节点崩溃:
PANIC: Uncorrectable Machine Check Error at CPU8. SKL_IIO Error: STATUS<0xbb80000000000e0b>(VALID,UC,EN,MISCV,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0),MCACOD(0xe0b))MISC<0x0000000017000000>(UCR_BUS_LOG(23),UCR_DEVICE_LOG(0),UCR_FUNCTION_LOG(0),UCR_SEGMENT_LOG(0))IIO Machine Check from device(s):RPT(23,0,0):ErrSrcID(CorrSrc(0),UCorrSrc(0x1700)), PCI Device 1000:d1 in slot 1 on Controller. in process idle: cpu8 on release 9.7P11 (C) on Fri Apr 16 12:08:16 JST 2021
在
SYSCONFIG-AC
中, MiniSAS HBA 卡位于控制器上的插槽 1 中:sysconfig: slot 1 OK: X2072A: 4x12G miniSAS Controler HBA