SwitchPowerFail_Alert 警告がシステムによって報告される
環境
- ONTAP 9
- NetAppから購入したCisco Nexusクラスタネットワークスイッチ
問題
- 複数の クラスタネットワークスイッチで同時にスイッチの電源装置に関するアラートが報告されます。
::> system health alert show
Node: node_name-1
Resource: module-1 BACK
Severity: Major
Indication Time: Thu Apr 14 19:25:23 2022
Suppress: false
Acknowledge: false
Probable Cause: Sensor "module-1 BACK" on switch
"switch_name-1(ABC12234DE5)" is reporting a
temperature of 4294967168 C, which is above the
critical threshold.
Possible Effect: Switch "switch_name-1(ABC12234DE5)" might shut down
if the switch temperature remains above the critical
threshold.
Corrective Actions: 1) Verify that the fans on the switch are working properly.
2) Maintain the switch's recommended operating environment.
3) Verify that the front and rear panels of the switch
are clear of any obstructions.
Node: node_name-1
Resource: module-1 BACK
Severity: Major
Indication Time: Thu Apr 14 19:25:23 2022
Suppress: false
Acknowledge: false
Probable Cause: Sensor "module-1 BACK" on switch
"switch_name-2(ABC12234DE6)" is reporting a
temperature of 4294967168 C, which is above the
critical threshold.
Possible Effect: Switch "switch_name-2(ABC12234DE6)" might shut down
if the switch temperature remains above the critical
threshold.
Corrective Actions: 1) Verify that the fans on the switch are working properly.
2) Maintain the switch's recommended operating environment.
3) Verify that the front and rear panels of the switch
are clear of any obstructions.
2 entries were displayed
- Cluster Switch Health Monitoring(CSHM)がヘルスアラートのあとにレポートする
"PowerSupply" on switch "switch name" is missing or is not operational.
- 監視されるEMSメッセージ
[netapp-01: cshmd: hm.alert.raised:alert]: Alert Id = SwitchPowerFail_Alert , Alerting Resource = switch name/PowerSupply raised by monitor cluster-switch
[netapp-01: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: SwitchPowerFail_Alert[switch name/PowerSupply].
- スイッチのログがPSU問題を示している
# show environment
Fan:
---------------------------------------------------------------------------
Fan Model Hw Direction Status
---------------------------------------------------------------------------
...
Fan_in_PS2 -- -- back-to-front Shutdown
...
Power Supply:
Voltage: 12 Volts
Power Actual Actual Total
Supply Model Output Input Capacity Status
(Watts ) (Watts ) (Watts )
------- ---------- --------------- ------ ---------- --------------------
1 NXA-PAC-1100W-PE2 276 W 312 W 1100 W Ok
2 NXA-PAC-1100W-PE2 0 W 0 W 0 W Shutdown
Power Usage Summary:
--------------------
...
Total Grid-A (first half of PS slots) Power Capacity 1100.00 W
Total Grid-B (second half of PS slots) Power Capacity 0.00 W
...
Temperature:
--------------------------------------------------------------------
Module Sensor MajorThresh MinorThres CurTemp Status
(Celsius) (Celsius) (Celsius)
--------------------------------------------------------------------
1 FRONT 80 70 48 Ok
1 BACK 70 42 n/a Failure
1 CPU 90 80 57 Ok
1 Heavenly 110 90 68 Ok
# show environment fan detail
Fan:
---------------------------------------------------------------------------
Fan Model Hw Direction Status
---------------------------------------------------------------------------
...
Fan_in_PS1 -- -- back-to-front Ok
Fan_in_PS2 -- -- back-to-front Shutdown
...
# show hardware
...
---------------------------------------
Chassis has 2 PowerSupply Slots
---------------------------------------
...
PS2 fail/shutdown
Power supply type is: 1100.00W 220v AC
Model number is NXA-PAC-1100W-PE2
H/W version is 160
Part Number is 123-4567-89
Part Revision is A0
Manufacture Date is Year 2021 Week 27
Serial number is ABC1234D5E6
CLEI code is ABCDEFGHIJ