学习周期之外触发电池温度低警报
适用场景
- FAS2650
- ONTAP 9.7P22
问题描述
- 由于电池温度过低,触发了以下 ASUP (AutoSupport) 警报:
HA Group Notification from NAP-CL7-02 (BATTERY (temperature low)) ALERT
system sensors
显示电池温度为critlow
(极低),一些传感器显示not_available
温度为-17°C
。
Bat Volt normal 8100 mV 5500 mV 5600 mV 8500 mV 8600 mV
Bat Pct Capacity normal 64 % 39 % 44 % -- --
Bat Curr not_available -- mA -- -- 1200 mA 1520 mA
Bat Rem Cap normal 768 mA*hr -- -- -- --
Bat Full Cap normal 1024 mA*hr -- -- -- --
Bat Charge Curr not_available -- mA -- -- 2200 mA 2300 mA
Bat Charging State failed --
Bat Charge Volt normal 8200 mV -- -- 8900 mV 9000 mV
Bat Initial FCC normal 1600 mA*hr -- -- -- --
Bat Dstg Cycles normal 13 cycles 2 cycles 5 cycles -- --
Bat Power Fault GOOD
Bat Dcharge FET ON
Bat Charge FET ON
Bat Cycle Count normal 35 cycles -- -- -- --
Bat Learning Active not_available --
Bat Pack Invalid INVALID
Battery Charge failed --
Bat_Temp -17.000 degrees C cr 0.000 5.000 60.000 75.000
- 故障日志来自
hsamcmd --fault-show-all
:
tag origin fld fault reason count time
---- ------- ---- ------------- ------ -----
1 0x5 /chassis-1/controller-b/nvbatt-1 Temperature Sensor 92 LNC 1 Mon Apr 21 05:36:24 2025
2 0x5 /chassis-1/controller-b/nvbatt-1 Temperature Sensor 92 LCR 1 Mon Apr 21 05:36:24 2025
3 0x5 /chassis-1/controller-b/nvbatt-1 Environmental Condition by env_mgr 1 Mon Apr 21 05:36:24 2025
4 0x5 /chassis-1/controller-b/nvbatt-1 Failed reading sensor: 105 1 Mon Apr 21 05:36:30 2025
5 0x5 /chassis-1/controller-b/nvbatt-1 Failed reading sensor: 106 1 Mon Apr 21 05:36:30 2025
6 0x5 /chassis-1/controller-b/nvbatt-1 Failed reading sensor: 233 1 Mon Apr 21 05:36:30 2025
7 0x5 /chassis-1 SAS Expander has set the Chassis LED ON 1 Mon Apr 21 05:36:30 2025
8 0x5 /chassis-1/controller-b/nvbatt-1 Failed reading sensor: 93 1 Mon Apr 21 05:36:37 2025
9 0x5 /chassis-1/controller-b/nvbatt-1 Failed reading sensor: 94 1 Mon Apr 21 05:36:37 2025
10 0x5 /chassis-1/controller-b/nvbatt-1 Failed reading sensor: 97 1 Mon Apr 21 05:36:37 2025
11 0x5 /chassis-1/controller-b/nvbatt-1 Failed reading sensor: 98 1 Mon Apr 21 05:36:37 2025
12 0x5 /chassis-1/controller-b/nvbatt-1 Failed reading sensor: 101 1 Mon Apr 21 05:36:38 2025
- 事件日志:
Record 2631: Mon Apr 21 05:35:46 2025 [IPMI.notice]: 5502 | 02 | EVT: 0150ef05 | Bat_Temp | Assertion Event, "Lower Non-critical going low "
Record 2632: Mon Apr 21 05:35:46 2025 [IPMI.notice]: 5602 | 02 | EVT: 0152ef00 | Bat_Temp | Assertion Event, "Lower Critical going low "
Record 2633: Mon Apr 21 05:35:46 2025 [IPMI.notice]: 5702 | 02 | EVT: 0301ffff | Bat_Pack_Invalid | Assertion Event, "State Asserted"
Record 2634: Mon Apr 21 05:35:46 2025 [IPMI.notice]: 5802 | 02 | EVT: 0301ffff | Attn_Sensor1 | Assertion Event, "State Asserted"
Record 2635: Mon Apr 21 05:36:00 2025 [IPMI.notice]: 5902 | 02 | EVT: 0300ffff | Attn_Sensor1 | Assertion Event, "State Deasserted"
Record 2636: Mon Apr 21 05:36:18 2025 [IPMI.notice]: 5a02 | 02 | EVT: 81522e00 | Bat_Temp | Deassertion Event, "Lower Critical going low "
Record 2637: Mon Apr 21 05:36:18 2025 [IPMI.notice]: 5b02 | 02 | EVT: 81502e05 | Bat_Temp | Deassertion Event, "Lower Non-critical going low "
Record 2638: Mon Apr 21 05:36:19 2025 [IPMI.notice]: 5c02 | 02 | EVT: 0300ffff | Bat_Pack_Invalid | Assertion Event, "State Deasserted"
Record 2639: Mon Apr 21 05:36:19 2025 [IPMI.notice]: 5d02 | 02 | EVT: 0301ffff | Attn_Sensor1 | Assertion Event, "State Asserted"
Record 2640: Mon Apr 21 05:36:24 2025 [IPMI.notice]: 5e02 | 02 | EVT: 0150ef05 | Bat_Temp | Assertion Event, "Lower Non-critical going low "
Record 2641: Mon Apr 21 05:36:24 2025 [IPMI.notice]: 5f02 | 02 | EVT: 0152ef00 | Bat_Temp | Assertion Event, "Lower Critical going low "
Record 2642: Mon Apr 21 05:36:24 2025 [IPMI.notice]: 6002 | 02 | EVT: 0301ffff | Bat_Pack_Invalid | Assertion Event, "State Asserted"
- EMS日志:
Mon Apr 21 14:35:57 +0900 [NAP-CL7-02: env_mgr: nvmem.battery.tempLow:alert]: The NVMEM battery is too cold (-17 C). To prevent data loss, the system will shut down in 24 hours.
Mon Apr 21 14:36:25 +0900 [NAP-CL7-02: env_mgr: nvmem.battery.temp.normal:info]: The NVMEM battery temperature is normal.
Mon Apr 21 14:36:25 +0900 [NAP-CL7-02: env_mgr: nvmem.battery.packValid:notice]: A valid NVMEM battery pack is present.
Mon Apr 21 14:36:47 +0900 [NAP-CL7-02: env_mgr: callhome.battery.warning:alert]: Call home for BATTERY (temperature low) WARNING.
- 类似的 KB 文章,但错误发生在电池学习周期之外: EMS 报告电池学习周期期间 NVMEM 电池过冷 17 C 消息
- SP-LATEST-IPMI 报告显示 StateOfHealth 为 43%:
DeviceChemistry : LION
ChemistryID : 0x0286
DeviceName : bq40z50-R1
ID : 27100045
Manufacturer : Nexergy
FW_Version : A2
Manufacturer-date : 3/13/2018
GG_HW_RV : 0x000a
GG_FW_RV : 0x0045 0x0601 0x2400 0x00 0x8503
StateOfHealth : 43 %
BatteryMode : 0x0001
BatteryStatus : 0x00c0
LrnCycleStartDate : 25-04-10-12:00
LrnCycleStartFCC : 0x0559
LrnCycleEndFCC : 0x0545
LrnCycleEndDate : 25-04-10-17:25