在固件更新失败后定期出现 CriticalFanFruFaultAlert 警报
适用于
- ONTAP 9
- AFF-A400 / FAS8300 / FAS8700
- BMC 13.11
问题描述
FAN Fru 错误由一个控制器报告,在下面的示例中,节点 2:
- 事件日志中看到的警报:
Tue Sep 02 09:09:51 +0200 [cluster1-02: cphmd: hm.alert.raised:alert]: Alert Id = CriticalFanFruFaultAlert , Alerting Resource = 042352004520 raised by monitor chassis
Tue Sep 02 09:09:51 +0200 [cluster1-02: cphmd: hm.alert.raised:alert]: Alert Id = CriticalFanFruFaultAlert , Alerting Resource = 042352004524 raised by monitor chassis
system service processor show
命令显示其中一个节点 BMC 位于以下固件版本。Node Type Status Configured Version IP Address
------------- ---- ----------- ------------ --------- -------------------------
cluster1-01 - unknown - 13.11 -
cluster1-02 - unknown - 13.12 -
2 entries were displayed.
- 调查 EMS,观察到失败的固件更新:
Tue Sep 02 08:43:54 +0200 [cluster1-01: servprocd: sp.servprocd.upd.evts:debug]: params: {'reason': 'SP Firmware network update from 13.11P1 to 13.12 has been triggered.'}
Tue Sep 02 08:59:24 +0200 [cluster1-01: servprocd: sp.servprocd.upd.unexpt.evts:debug]: params: {'reason': 'BMC update - BMC Firmware update timed out.'}
Tue Sep 02 08:59:24 +0200 [cluster1-01: servprocd: sp.servprocd.upd.error:error]: SP update error: SP firmware update failure has been detected.
Tue Sep 02 09:18:08 +0200 [cluster1-01: servprocd: sp.servprocd.upd.error:error]: SP update error: SP Firmware network auto-update could not be scheduled.
Platform-Sensors
在通过 Active IQ 数字顾问的 AutoSupport 中,显示 FAN 报告为禁用:
传感器名称 |
传感器类型 |
传感器状态 |
SysFan1 F1 |
离散 |
禁用 |
SysFan1 F2 |
离散 |
禁用 |
SysFan2 F1 |
离散 |
禁用 |
SysFan2 F2 |
离散 |
禁用 |
SysFan3 F1 |
离散 |
禁用 |
SysFan3 F2 |
离散 |
禁用 |
SysFan4 F1 |
离散 |
禁用 |
SysFan4 F2 |
离散 |
禁用 |