40G/100G以太网控制器CX5卡缺失并发生崩溃
适用场景
- AFF A400、C400
- AF-A800
- FAS9300、FAS8700
- X1148A:40G/100G以太网控制器CX5
问题描述
-
sysconfig -a
输出中缺少卡
- 中的卡"不支持"
sysconfig -ac
sysconfig: Card in slot 1 (1-15B3-1017-0) is not supported.
sysconfig: slot 3 OK: X1151A: 2p 100G Naples-100 Smart IO
sysconfig: slot 4 OK: X1148A: 40G/100G Ethernet Controller CX5
sysconfig: slot 5 OK: X1148A: 40G/100G Ethernet Controller CX5
- 启动时可能出现错误
Jul 24 07:08:29 [XXXX-01-XXXXX:netif.init.failed:ALERT]: Initialization of network interface e5a failed due to unexpected software error mlx5_core err=0xfffffff0:115.
- 该卡可能会触发崩溃:
Panic String: Uncorrectable Machine Check Error at CPU1. SKL_IIO Error: STATUS<0xbb80000000000e0b>(VALID,UC,EN,MISCV,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0),MCACOD(0xe0b))MISC<0x0000000017000000>(UCR_BUS_LOG(23),UCR_DEVICE_LOG(0),UCR_FUNCTION_LOG(0),UCR_SEGMENT_LOG(0))IIO Machine Check from device(s):RPT(23,0,0):ErrSrcID(CorrSrc(0),UCorrSrc(0x1a00)), Mellanox CX5 Ethernet in slot 1 on Controller, Mellanox CX5 Ethernet in slot 1 on Controller. in process idle: cpu1 on release 9.7P6 (C)