机箱风扇 FRU 故障且机箱配置无效
适用场景
- AFF A150
- ONTAP 9
- 集群模式 HA 配置
问题描述
- 两个节点同时报告“ CHASSIS FAN FRU FAILED: 多个风扇发生故障”:
HA Group Notification from Cluster-01 (CHASSIS FAN FRU FAILED: Multiple fans have failed) ERROR
 HA Group Notification from Cluster-02 (CHASSIS FAN FRU FAILED: Multiple fans have failed) ERROR
- EMS错误显示多个传感器不可用:
[Cluster: env_mgr: callhome.c.fan.fru.fault:error]: Call home for CHASSIS FAN FRU FAILED: Multiple fans have failed
- 以下尝试未能解决问题,无法清除警报。
- 已将 BIOS 和 BMC 固件升级到最新版本;
- 重启两个节点的 BMC。
- 重新安装主板并尝试更换主板。即使使用了新的主板,每个节点在重新安装后仍无法启动,并出现以下错误。
Node A:
 +++++++++++++++++++++++++++
 Initializing System Memory ...
 Loading Device Drivers ...
 Waiting for SP ...
 IPMI:Read midplane FRU common header:timeout
 SP failure. Resetting SP from primary FW. This can take a few minutes
 Waiting for SP ...
 SP recovered successfully after a reset from primary FW image
 Waiting for SP ...
 IPMI:Read midplane FRU common header:failed
 Configuring Devices ...
CPU = 2 Processor(s) Detected.
   Intel(R) Xeon(R) CPU D-1557 @ 1.50GHz (CPU 0)
   CPUID: 0x00050664. Cores per Processor = 12
   CPU1 (CPU 1)
   CPUID: 0x00000000. Cores per Processor = 0
 32768 MB System RAM Installed.
 SATA (AHCI) Device: ATP SATAIII mSATA AF120GSTHI-NT5
Boot Loader version 8.1.0 
 Copyright (C) 2000-2003 Broadcom Corporation.
 Portions Copyright (C) 2002-2023 NetApp, Inc. All Rights Reserved.
 BIOS POST Failure(s) detected: Failed to get FRU data. Abort AUTOBOOT
BMC Event log:
 Record 120: Tue Jun 03 14:38:13.347240 2025 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Send failed
 Record 121: Tue Jun 03 14:39:26.286753 2025 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Send failed
Node B:
 +++++++++++++++++++++++++++
 Starting program at 0xffffffff80348000
 ---<<BOOT>>---
 NetApp Data ONTAP 9.14.1P1
 random: registering fast source Intel Secure Key RNG
 ***************************************************
  This platform is not supported in this release.   
  The system will now halt  
 ***************************************************
BIOS Version: 11.21
 Portions Copyright (C) 2014-2023 NetApp, Inc. All Rights Reserved.
Initializing System Memory ...
 Loading Device Drivers ...
 Waiting for SP ...
 BIOS Version: 11.21
 Portions Copyright (C) 2014-2023 NetApp, Inc. All Rights Reserved.
Initializing System Memory ...
 Loading Device Drivers ...
 Waiting for SP ...
 IPMI:Read midplane FRU common header:timeout
 SP failure. Resetting SP from primary FW. This can take a few minutes
 Waiting for SP ...
 SP recovered successfully after a reset from primary FW image
 Waiting for SP ...
 IPMI:Read midplane FRU common header:failed
Configuring Devices ...
 CPU = 2 Processor(s) Detected.
   Intel(R) Xeon(R) CPU D-1557 @ 1.50GHz (CPU 0)
   CPUID: 0x00050664. Cores per Processor = 12
   CPU1 (CPU 1)
   CPUID: 0x00000000. Cores per Processor = 0
 32768 MB System RAM Installed.
 SATA (AHCI) Device: ATP SATAIII mSATA AF120GSTHI-NT5
Boot Loader version 8.1.0 
 Copyright (C) 2000-2003 Broadcom Corporation.
 Portions Copyright (C) 2002-2023 NetApp, Inc. All Rights Reserved.
 BIOS POST Failure(s) detected: Failed to get FRU data. Abort AUTOBOOT
BMC Event log:
 Record 444: Tue Jun 03 14:45:55.696857 2025 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Send failed
 Record 445: Tue Jun 03 14:47:08.277524 2025 [ASUP.notice]: First notification email | (INVALID CHASSIS CONFIGURATION (Incompatible Partner PCM)) CRITICAL | Send failed
  
4.进行了跨控制器测试,发现了底盘故障。