NetApp FAS 控制器关闭并显示多个机箱风扇故障警告,但只有一个机箱风扇 FRU 显示故障
适用场景
- AFF
- FAS
问题描述
- 即使只有一个现场可更换单元( FRU )需要更换、控制器仍会因多个风扇故障而关闭。
- 可以生成类似于以下内容的 Autosupports :
HA Group Notification (MULTIPLE CHASSIS FAN FAILED: System will shut down in 2 minutes) ERROR
HA Group Notification (Health Monitor process nphm: NphmCriticalFanFruFaultAlert[xxxxxxxxxxxx]) CRITICAL
HA Group Notification (CHASSIS FAN FRU FAILED: SysFan1 F1) ERROR
HA Group Notification (CHASSIS FAN FRU FAILED: SysFan1 F2) ERROR
- 机箱系统风扇故障的 EMS 错误:
Jun 05 00:35:42 [cluster-n01:monitor.chassisFanFail.xMinShutdown:EMERGENCY]: Multiple Chassis Fan failure: System will shut down in 2 minutes.
Jun 05 00:36:00 [cluster-n01:monitor.globalStatus.critical:EMERGENCY]: Multiple fans has failed: SysFan1 F2, SysFan1 F1.
Jun 05 00:38:07 [cluster-n01:monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (Multiple fans failed)
- 机箱 IO风 扇故障的 EMS 错误:
Feb 27 14:06:51 [cluster-n01:monitor.chassisFanFail.xMinShutdown:EMERGENCY]: Multiple Chassis Fan failure: System will shut down in 2 minutes.
Feb 27 14:07:00 [cluster-n01:monitor.globalStatus.critical:EMERGENCY]: Multiple fans has failed: IOfan3 F1, IOfan3 F2.
Feb 27 14:09:21 [cluster-n01:monitor.shutdown.emergency:EMERGENCY]: Emergency shutdown: Environmental Reason Shutdown (Multiple IO fans failed)
- SP/BMC系统传感器显示为FAN1_1、而Fan1_2
na
为状态。
Fan1_1 | na | RPM | na | na | 600.000 | 900.000 | na | na | na
Fan2_1 | 12600.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan3_1 | 12600.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan4_1 | 12600.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan1_2 | na | RPM | na | na | 600.000 | 900.000 | na | na | na
Fan2_2 | 12700.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan3_2 | 12700.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na
Fan4_2 | 12700.000 | RPM | ok | na | 600.000 | 900.000 | na | na | na