由于CRC错误以及良好的EON超出阈值、刀片式服务器在Brocade交换机上进入故障状态
适用场景
Brocade交换机
问题描述
- Brocade交换机插槽6上的刀片式服务器进入故障状态。
- Brocade交换机插槽6的芯片/刀片上的一个交换机端口超过
CRC errors with GoodEOF阈值。 - 在
Errdumpon Brocade Switch-下报告以下事件
2024/11/15-02:27:19 (IST), [C4-1030], 12084732, SLOT 1 | CHASSIS | PORT 6/4, WARNING, Switch, S6,P-1(20): Internal CRC with good EOF errors exceeded threshold, tuning is required. current:0x19, last:0xf000000b thresh2:0x9000000a. 2024/11/15-02:27:19 (IST), [C4-1056], 12084733, SLOT 1 | CHASSIS, CRITICAL, Switch, Chip in Slot 6, Chip 0 getting faulted with reason 52. 2024/11/15-02:27:20 (IST), [EM-1134], 12084734, SLOT 1 | FFDC | CHASSIS, ERROR, Switch, Slot 6 set to faulty, rc=20015. 2024/11/15-02:27:20 (IST), [RAS-1001], 12084735, SLOT 1 | CHASSIS, INFO, Switch, First failure data capture (FFDC) event occurred. 2024/11/15-02:27:20 (IST), [MAPS-1002], 12084736, SLOT 1 | FID 128, ERROR, Switch, Blade 6, Condition=ALL_SLOTS(BLADE_STATE/NONE==FAULTY), Current Value:[BLADE_STATE, FAULTY], RuleName=defALL_SLOTSBLADE_STATE_FAULTY, Dashboard Category=Fru Health, Quiet Time=None. 2024/11/15-02:27:50 (IST), [MAPS-1021], 12084738, SLOT 1 | FID 128, WARNING, Switch, RuleName=defCHASSISFAULTY_BLADE_1, Condition=CHASSIS(FAULTY_BLADE/NONE>=1), Obj:Chassis [FAULTY_BLADE,1] has contributed to switch status MARGINAL. 2024/11/15-02:27:50 (IST), [MAPS-1020], 12084739, SLOT 1 | FID 128, WARNING, Switch, Switch wide status has changed from HEALTHY to MARGINAL.
Slotshow -m将插槽6上的交换机刀片报告为Faulty-
/fabos/cliexec/slotshow -m: Slot Blade Type ID Model Name Status -------------------------------------------------- 1 CP BLADE 175 CPX6 ENABLED 2 CP BLADE 175 CPX6 ENABLED 3 SW BLADE 178 FC32-48 ENABLED 4 SW BLADE 178 FC32-48 ENABLED 5 SW BLADE 178 FC32-48 ENABLED 6 SW BLADE 178 FC32-48 FAULTY (21) 7 CORE BLADE 177 CR32-8 ENABLED 8 CORE BLADE 177 CR32-8 ENABLED 9 SW BLADE 178 FC32-48 ENABLED 10 SW BLADE 178 FC32-48 ENABLED 11 SW BLADE 178 FC32-48 ENABLED 12 SW BLADE 178 FC32-48 ENABLED