只有一个节点报告温度升高以及两个电源出现故障
适用场景
问题描述
- 只有一个AFF A250节点报告机箱温度过高和两个电源出现故障。示例:
::>event log show -event monitor*,chassis*
Severity Event
--------- ----------------------------------------------
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 In Fault is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Fan is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 FB Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Inlet is Unreadable
ERROR callhome.chassis.hitemp: Call home for CHASSIS OVER TEMPERATURE
EMERGENCY monitor.globalStatus.critical: Power Supply Status Critical: PSU2, PSU1. Chassis temperature is too high..
ERROR monitor.temp.unreadable: The controller temperature (PSU2 FB Hot) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU2 Hot) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU2 Inlet) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU1 FB Hot) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU1 Hot) is not readable.
ERROR monitor.temp.unreadable: The controller temperature (PSU1 Inlet) is not readable.
ERROR callhome.chassis.ps.degraded: Call home for CHASSIS POWER SUPPLY DEGRADED: PS 1
EMERGENCY monitor.globalStatus.critical: Power Supply Status Critical: PSU2, PSU1.
ERROR callhome.chassis.power: Call home for CHASSIS POWER DEGRADED: Power Supply Status Critical: PSU2, PSU1.
ALERT monitor.chassisPower.degraded: Chassis power is degraded: Power Supply Status Critical: PSU2,PSU1.
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 2 is degraded: PSU2 Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 2 is degraded: PSU2 Inlet is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Out Fault is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Warning is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 In Fault is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Fan is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 FB Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Hot is Unreadable
NOTICE monitor.chassisPowerSupply.degraded: Chassis power supply 1 is degraded: PSU1 Inlet is Unreadable
- 配对节点中的所有传感器均正常:
::> system controller environment show
Node FRU Name State
------------------ ------------------------------ -----------
netappv17-01 PSU2 GOOD
netappv17-01 PSU1 GOOD
netappv17-02 PSU2 unknown
netappv17-02 PSU1 unknown
- 一个嵌入式NSM8E模块报告:
[dsa_worker3: ses.status.enclWarn:error]: NS224NSM8E (S/N SHJHU0123456789) shelf 0 on channel 0s disk enclosure warning for Enclosure 1: VPD EEPROMs mismatch or unreadable. This element is on the unknown location.
[dsa_worker3: ses.status.ModuleWarn:alert]: NS224NSM8E (S/N SHJHU0123456789) shelf 0 on channel 0s PCI switch warning for PCI Switch 2: non-critical status; Backplane VPD SEEROM corrupt or unreadable. This element is on the rear of the shelf at the bottom, on shelf module (B).
storage show fault
输出中所示的顺序相同。
Enclosure Status: non-critical
Channel: 0s
Shelf: 0
Shelf Type: NS224NSM8E
Module Type: NSM8E
Enclosure:
Element Status Status Bytes Status Descriptions
1: NONCRITICAL 03,00,00,00
PSM:
Element Status Status Bytes Status Descriptions
1 [NSM8E A] : NONCRITICAL 03,0C,00,00 MIDPLANE VPD FAULT, MASTER
2 [NSM8E B] : NONCRITICAL 03,04,00,00 MIDPLANE VPD FAULT
- 在重新安装PSU和节点后、问题描述 仍会保持不变。
- 问题描述 仍使用已知正常工作的PSU。
- 在两个机箱之间交换控制器时、问题描述 会使用一个机箱插槽。