PSU报告严重状态并发出磁盘架电源中断警报
适用场景
- FAS/AFA系统
- 磁盘架
- 电源设备(PSU)
问题描述
- 事件日志中会报告以下警报:
[Node-A: statd: monitor.shelf.fault:alert]: Critical fault reported on disk storage shelf attached to channel 0a. Check fans, power supplies, disks, and temperature sensors.
[Node-A: statd: callhome.shlf.fault:error]: Call home for SHELF_FAULT
[Node-A: dsa_worker4: ses.status.psError:alert]: DS212-12 (S/N SHFNCXXXXXXXXXX) shelf 0 on channel 0b power error for Power supply 2: critical status; AC Fail. This module is on the rear of the shelf at the bottom right.
[Node-A: dsa_worker4: callhome.shlf.power.intr:error]: Call home for SHELF POWER INTERRUPTED
- 存储架上出现以下错误:
Cluster1::> storage shelf show -errors
Shelf Name: 1.0
Shelf UID: 50:0a:09:XX:XX:XX:XX:XX
Serial Number: SHFNCXXXXXXXXXX
Error Type Description
------------------ ---------------------------
Power Critical condition is detected in storage shelf power supply unit "2". The unit might fail.
Voltage Critical error detected in voltage sensor "4". Sensor is located in the rear of the shelf on the lower right power supply.
Voltage Critical error detected in voltage sensor "3". Sensor is located in the rear of the shelf on the lower right power supply.
- 电源状态在
storage show fault
输出中报告为故障:
Cluster1::> system node run -node <node> -command storage show fault
Enclosure Status: unrecoverable
Channel: 0a
Shelf: 0
Shelf Type: DS224-12
Product Serial Number: SHFNCXXXXXXXXXX
Module Type: IOM12
Power Supplies:
Element Status Status Bytes Status Descriptions
1: OK 01,00,00,20 RQSTED ON
2: CRITICAL 02,00,00,F3 DC FAIL, AC FAIL, OFF, RQSTED ON, FAIL
Enclosure:
Element Status Status Bytes Status Descriptions
1: OK 01,00,02,00 FAIL
- 环境状态报告:
Cluster1::>system node run -node <nodename> -command environment status
Channel: 2a
Shelf: 11
SES device path: local access: 2a.11.99
Module type: IOM6; monitoring is active
Shelf status: unrecoverable condition
SES Configuration, shelf 11:
logical identifier=
vendor identification=NETAPP
product identification=DS4246
product revision level=0211
Vendor-specific information:
Product Serial Number: SHJSG1234000011
Status reads attempted: 9955419; failed: 0
Control writes attempted: 0; failed: 0
Shelf bays with disk devices installed:
23, 22, 21, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0
with error: none
Power Supply installed element list: 1, 2, 3, 4; with error: 2
Power Supply information by element:
[1] Serial number: PMW87654321D3DE Part number: 0082562-22
Type: 9C
Input voltage: <N/A>
Firmware version: 0311 Swaps: 0
[2] Serial number: PMW87654321D801 Part number: 0082562-22
Type: 9C
Input voltage: <N/A>
Firmware version: 0311 Swaps: 0
[3] Serial number: PMW87654321D757 Part number: 0082562-22
Type: 9C
Input voltage: <N/A>
Firmware version: 0311 Swaps: 0
[4] Serial number: PMW87654321C2DD Part number: 0082562-22
Type: 9C
Input voltage: <N/A>
Firmware version: 0311 Swaps: 0
Voltage Sensor installed element list: 1, 2, 3, 4, 5, 6, 7, 8; with error: 4
Shelf voltages by element:
[1] 5.01 Volts Normal voltage range
[2] 12.03 Volts Normal voltage range
[3] 3.02 Volts Normal voltage range
[4] <N/A> sensor condition <N/A>
[5] 5.01 Volts Normal voltage range
[6] 12.03 Volts Normal voltage range
[7] 5.01 Volts Normal voltage range
[8] 12.03 Volts Normal voltage range