跳转到主内容

SwitchPowerFail_Alert 警告がシステムによって報告される

Views:
1
Visibility:
Public
Votes:
0
Category:
fabric-interconnect-and-management-switches<a>2009044641</a>
Specialty:
hw
Last Updated:

環境

  • ONTAP 9
  • NetAppから購入したCisco Nexusクラスタネットワークスイッチ

問題

  • 複数の クラスタネットワークスイッチで同時にスイッチの電源装置に関するアラートが報告されます。

::> system health alert show
        Node: node_name-1
      Resource: module-1 BACK
      Severity: Major
   Indication Time: Thu Apr 14 19:25:23 2022
      Suppress: false
     Acknowledge: false
   Probable Cause: Sensor "module-1 BACK" on switch
           "switch_name-1(ABC12234DE5)" is reporting a
           temperature of 4294967168 C, which is above the
           critical threshold.
   Possible Effect: Switch "switch_name-1(ABC12234DE5)" might shut down
           if the switch temperature remains above the critical
           threshold.
Corrective Actions: 1) Verify that the fans on the switch are working properly.
           2) Maintain the switch's recommended operating environment.
           3) Verify that the front and rear panels of the switch
           are clear of any obstructions.
        Node: node_name-1
      Resource: module-1 BACK
      Severity: Major
   Indication Time: Thu Apr 14 19:25:23 2022
      Suppress: false
     Acknowledge: false
   Probable Cause: Sensor "module-1 BACK" on switch
           "switch_name-2(ABC12234DE6)" is reporting a
           temperature of 4294967168 C, which is above the
           critical threshold.
   Possible Effect: Switch "switch_name-2(ABC12234DE6)" might shut down
           if the switch temperature remains above the critical
           threshold.
Corrective Actions: 1) Verify that the fans on the switch are working properly.
           2) Maintain the switch's recommended operating environment.
           3) Verify that the front and rear panels of the switch
           are clear of any obstructions.
2 entries were displayed

  • Cluster Switch Health Monitoring(CSHM)がヘルスアラートのあとにレポートする

"PowerSupply" on switch "switch name" is missing or is not operational.

  • 監視されるEMSメッセージ

[netapp-01: cshmd: hm.alert.raised:alert]: Alert Id = SwitchPowerFail_Alert , Alerting Resource = switch name/PowerSupply raised by monitor cluster-switch
[netapp-01: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: SwitchPowerFail_Alert[switch name/PowerSupply].

  • スイッチのログがPSU問題を示している

# show environment 
Fan:
---------------------------------------------------------------------------
Fan        Model         Hw    Direction     Status
---------------------------------------------------------------------------
...
Fan_in_PS2    --           --    back-to-front  Shutdown
...
Power Supply:
Voltage: 12 Volts
Power            Actual        Actual     Total
Supply   Model       Output        Input     Capacity    Status
              (Watts )       (Watts )    (Watts )
-------  ----------  ---------------  ------  ----------  --------------------
1     NXA-PAC-1100W-PE2    276 W        312 W    1100 W    Ok
2     NXA-PAC-1100W-PE2     0 W         0 W      0 W  Shutdown
Power Usage Summary:
--------------------
...
Total Grid-A (first half of PS slots) Power Capacity     1100.00 W
Total Grid-B (second half of PS slots) Power Capacity     0.00 W
...
Temperature:
--------------------------------------------------------------------
Module   Sensor     MajorThresh   MinorThres   CurTemp    Status
            (Celsius)    (Celsius)   (Celsius)      
--------------------------------------------------------------------
1     FRONT       80        70      48      Ok        
1     BACK       70        42      n/a     Failure    
1     CPU        90        80      57      Ok        
1     Heavenly     110        90      68      Ok        

# show environment fan detail
Fan:
---------------------------------------------------------------------------
Fan        Model         Hw    Direction     Status
---------------------------------------------------------------------------
...
Fan_in_PS1    --           --    back-to-front   Ok
Fan_in_PS2    --           --    back-to-front  Shutdown
...

# show hardware
...
---------------------------------------
Chassis has 2 PowerSupply Slots
---------------------------------------
...
PS2 fail/shutdown
  Power supply type is: 1100.00W 220v AC
  Model number is NXA-PAC-1100W-PE2
  H/W version is 160
  Part Number is 123-4567-89
  Part Revision is A0
  Manufacture Date is Year 2021 Week 27
  Serial number is ABC1234D5E6
  CLEI code is ABCDEFGHIJ

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.
Scan to view the article on your device