机箱风扇 FRU 故障且机箱配置无效
适用场景
- AFF A150
- ONTAP 9
- 集群模式 HA 配置
问题描述
- 两个节点同时报告“ CHASSIS FAN FRU FAILED: 多个风扇发生故障”:
HA Group Notification from Cluster-01 (CHASSIS FAN FRU FAILED: Multiple fans have failed) ERROR
HA Group Notification from Cluster-02 (CHASSIS FAN FRU FAILED: Multiple fans have failed) ERROR
- EMS错误显示多个传感器不可用:
[Cluster: env_mgr: callhome.c.fan.fru.fault:error]: Call home for CHASSIS FAN FRU FAILED: Multiple fans have failed
- 以下尝试未能解决问题,无法清除警报。
- 已将 BIOS 和 BMC 固件升级到最新版本;
- 重启两个节点的 BMC。
- 重新安装主板并尝试更换主板。即使使用了新的主板,每个节点在重新安装后仍无法启动,并出现以下错误。
Node A:
+++++++++++++++++++++++++++
Initializing System Memory ...
Loading Device Drivers ...
Waiting for SP ...
IPMI:Read midplane FRU common header:timeout
SP failure. Resetting SP from primary FW. This can take a few minutes
Waiting for SP ...
SP recovered successfully after a reset from primary FW image
Waiting for SP ...
IPMI:Read midplane FRU common header:failed
Configuring Devices ...
CPU = 2 Processor(s) Detected.
Intel(R) Xeon(R) CPU D-1557 @ 1.50GHz (CPU 0)
CPUID: 0x00050664. Cores per Processor = 12
CPU1 (CPU 1)
CPUID: 0x00000000. Cores per Processor = 0
32768 MB System RAM Installed.
SATA (AHCI) Device: ATP SATAIII mSATA AF120GSTHI-NT5
Boot Loader version 8.1.0
Copyright (C) 2000-2003 Broadcom Corporation.
Portions Copyright (C) 2002-2023 NetApp, Inc. All Rights Reserved.
BIOS POST Failure(s) detected: Failed to get FRU data. Abort AUTOBOOT
BMC Event log:
Record 120: Tue Jun 03 14:38:13.347240 2025 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Send failed
Record 121: Tue Jun 03 14:39:26.286753 2025 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Send failed
Node B:
+++++++++++++++++++++++++++
Starting program at 0xffffffff80348000
---<<BOOT>>---
NetApp Data ONTAP 9.14.1P1
random: registering fast source Intel Secure Key RNG
***************************************************
This platform is not supported in this release.
The system will now halt
***************************************************
BIOS Version: 11.21
Portions Copyright (C) 2014-2023 NetApp, Inc. All Rights Reserved.
Initializing System Memory ...
Loading Device Drivers ...
Waiting for SP ...
BIOS Version: 11.21
Portions Copyright (C) 2014-2023 NetApp, Inc. All Rights Reserved.
Initializing System Memory ...
Loading Device Drivers ...
Waiting for SP ...
IPMI:Read midplane FRU common header:timeout
SP failure. Resetting SP from primary FW. This can take a few minutes
Waiting for SP ...
SP recovered successfully after a reset from primary FW image
Waiting for SP ...
IPMI:Read midplane FRU common header:failed
Configuring Devices ...
CPU = 2 Processor(s) Detected.
Intel(R) Xeon(R) CPU D-1557 @ 1.50GHz (CPU 0)
CPUID: 0x00050664. Cores per Processor = 12
CPU1 (CPU 1)
CPUID: 0x00000000. Cores per Processor = 0
32768 MB System RAM Installed.
SATA (AHCI) Device: ATP SATAIII mSATA AF120GSTHI-NT5
Boot Loader version 8.1.0
Copyright (C) 2000-2003 Broadcom Corporation.
Portions Copyright (C) 2002-2023 NetApp, Inc. All Rights Reserved.
BIOS POST Failure(s) detected: Failed to get FRU data. Abort AUTOBOOT
BMC Event log:
Record 444: Tue Jun 03 14:45:55.696857 2025 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Send failed
Record 445: Tue Jun 03 14:47:08.277524 2025 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Send failed
4.进行了跨控制器测试,发现了底盘故障。