ONTAP集群节点意外重新启动
适用场景
- ONTAP 9
- FAS 8700
问题描述
- 此节点意外重新启动、并显示以下崩溃字符串:
示例:
Panic String: Uncorrectable Machine Check Error at CPU25. SKL_IIO Error: STATUS<0xbb80000000000e0b>(VALID,UC,EN,MISCV,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0),MCACOD(0xe0b))MISC<0x0000000080000000>(UCR_BUS_LOG(128),UCR_DEVICE_LOG(0),UCR_FUNCTION_LOG(0),UCR_SEGMENT_LOG(0))IIO Machine Check from device(s):RPT(128,0,0):ErrSrcID(CorrSrc(0x8100),UCorrSrc(0x8250)), PLX PCIE 8733 switch on Controller, Br[8733](130,10,0): Link down. in process idle: cpu25 on release 9.13.1P8
- 执行完恢复操作后、插槽6中的Flash Cache卡在
sysconfig -a
输出中报告为缺失、并在sysconfig -ac
输出中标记为不受支持。
示例:
::> node run -node <node name> sysconfig -ac
sysconfig: Card in slot 6 (15-144D-A808-0) is not supported.
sysconfig -P
将显示 控制器中所有PCI设备的PCI层次结构。
::> node run -node <node_name> sysconfig -P
Br[2030](128,0,0): PCI Device 8086:2030 on Controller
LinkCap(MaxLkSp(3),MaxLkWd(4),ASPM(2),L0(3),L1(4),SurpDn,DLAct,Port(0))
LinkStatus(LkSp(3),LkWd(4),SClk,DLAct),
Br[8733](129,0,0): PLX PCIE 8733 switch on Controller
LinkCap(MaxLkSp(3),MaxLkWd(4),ASPM(2),L0(6),L1(2),Port(8))
LinkStatus(LkSp(3),LkWd(4)),
Br[8733](130,10,0): PLX PCIE 8733 switch on Controller
LinkCap(MaxLkSp(3),MaxLkWd(4),ASPM(2),L0(6),L1(2),SurpDn,DLAct,Port(10))
LinkStatus(LkSp(3),LkWd(4),DLAct),
Dv[a808](131,0,0) in slot 6: PCI Device 144d:a808 in slot 6 on Controller