机箱风扇 FRU 故障:即使在升级 SP/BMC 后,多个风扇仍出现故障
适用场景
- FAS2620
- FAS2650
- FAS2750
- FAS8200
- AFF A200
- AFF A220
- AFF A300
问题描述
- EMS 错误。示例:
[Cluster: env_mgr: monitor.fan.warning:notice]: multiple fans have failed. Replace it to avoid overheating
[Cluster: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Module B Expander Temp) is not readable.
[Cluster: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Module A Expander Temp) is not readable.
[Cluster: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Midplane 4 Temp) is not readable.
[Cluster: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Midplane 3 Temp) is not readable.
[Cluster: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Midplane 2 Temp) is not readable.
[Cluster: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Midplane 1 Temp) is not readable.
[Cluster: env_mgr: monitor.temp.unreadable:error]: The controller temperature (Ambient Temp) is not readable.
[Cluster: env_mgr: callhome.c.fan.fru.fault:error]: Call home for CHASSIS FAN FRU FAILED: Multiple fans have failed
- 事件日志中的 SP 负载较高。示例:
[SP.notice]: SP load is high: 2.62 2.37 2.04
[SP.notice]: SP load is high: 2.71 2.31 2.08
- SP 系统日志输出中反复出现 env_mgr 错误。示例:
env_mgr[1356]: Error opening ses, err -1
env_mgr[1356]: cdev_reopen: error opening device 'ses', err -1.
env_mgr[1356]: cdevices_poll_pending: cdev(2001) re-open, state is 4.