CHW-1885:AFF A320:"需要更换机箱电源"更换 PSU 后 AutoSupport
问题描述
- 在 AFF A320 系统上,机箱中的两个节点在 PSU 已更换后继续报告电源单元 (PSU) 错误。
- 以下是此问题的示例事件:
env_mgr: monitor.chassisPowerSupply.degraded:debug]:机箱电源X降级:PSU2 Volt Out is Critical High (YYYY mV)
monitor.chassisPowerSupply.degraded:debug]:机箱电源X降级:PSUX Curr Out is Critical Low (0 mA)
env_mgr: monitor.chassisPowerSupply.degraded:debug]:机箱电源X降级:PSU2 Power Out is Critical Low (0 W)
env_mgr: callhome.chassis.ps.replace:debug]:需要更换机箱电源:PS X
- 以下是显示 PSU 传感器问题的 "system node environment sensors show" 输出的示例:
...
PSUX fru fault normal MULTIFAULT
PSUX FRU Switch discrete normal normal ON
PSUX FRU Power discrete fault normal LOW
PSUX FRU Current discrete fault normal LOW
PSUX FRU Voltage discrete fault normal HIGH
- 以下是显示 PSU 传感器问题(cr = critical)的 BMC "system sensors show" 命令或 AutoSupport SP-LATEST-IPMI.txt
文件示例:
...
PSU2_AC_VIN | 204.000 | Volts | ok | 180.000 | 184.000 | 260.000 | 264.000
PSU2_AC_Curr_IIN | 0.156 | Amps | ok | 0.000 | 0.000 | 9.984 | 12.012
PSU2_FAN | 4500.000 | RPM | ok | 800.000 | 1200.000 | na | na
PSU2_Inlet_Temp | 23.000 | degrees C | ok | 0.000 | 5.000 | 50.000 | 60.000
PSU2_Hot_Temp | 33.000 | degrees C | ok | 0.000 | 0.000 | 100.000 | 105.000
PSU2_PIN | 0.000 | Watts | cr | 7.100 | 14.200 | 1611.700 | 1796.300
PSU2_Presence | 0x0 | discrete | Present | na | na | na | na
PSU2_FB_Hot_Temp | 27.000 | degrees C | ok | 0.000 | 0.000 | 100.000 | 105.000
PSU2_VOUT | 12.599 | Volts | cr | 11.400 | 11.400 | 12.589 | 12.599
PSU2_IOUT | 0.000 | Amps | cr | 0.000 | 0.000 | 140.000 | 149.000
PSU2_POUT | 0.000 | Watts | cr | 7.100 | 14.200 | 1611.700 | 1796.300
Power_Good | 0x0 | discrete | Asserted | na | na | na | na