跳转到主内容

NetApp_Insight_2020.png 

“SwitchFanNotPresent_Alert” reported by CSHM for Cisco cluster network switches

Views:
5
Visibility:
Public
Votes:
0
Category:
fas-systems
Specialty:
hw
Last Updated:

 

Applies to

Cisco cluster network switches

Issue

There are two possible issues in this scenario:

1. The following “SwitchFanNotPresent_Alert” is reported by CSHM for all the fan modules of the cluster network switch:

Wed Apr 29 15:53:23 AEST [nodename: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: SwitchFanNotPresent_Alert[switch(XXXXXXXXXXX)/Fan Module-1].
Wed Apr 29 15:28:23 AEST [nodename: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: SwitchFanNotPresent_Alert[switch(XXXXXXXXXXX)/Fan Module-2]

2. Despite no issues seen on cluster switches, following AutoSupport alert messages are received:

 Call home for Health Monitor process cshm: SwitchFanNotPresent_Alert

Cause

Cause for Issue 1:

  • This issue is likely to occur when the cluster switches fail to fetch fan related information.
  • The cluster switch health monitor (CSHM) can only detect that a fan or a power supply is missing when the queries to the switch fail to return information about all the fans or power supplies.

Cause for Issue 2:

  • False alerts are reported as a result of an SNMP communication problem between the switch and storage.
  • This happens when the node-management LIF is not hosted on the default management port: e0M.

Solution

  • Check if any maintenance activity was performed during the time the alert was reported by CSHM.
  • Verify if the alerts are cleared after some time, in the event logs:

cluster::> event log show
Time                Node             Severity      Event
------------------- ---------------- ------------- ---------------------------
4/29/2020 15:49:12  nodename         ALERT         hm.alert.raised: Alert Id = SwitchFanNotPresent_Alert , Alerting Resource = switch(XXXXXXXXXXX)/Fan Module-1 raised by monitor cluster-switch
4/29/2020 16:10:20  nodename         NOTICE        hm.alert.cleared: Alert Id = SwitchFanNotPresent_Alert , Alerting Resource = switch(XXXXXXXXXXX)/Fan Module-1 cleared by monitor cluster-switch

  • Login to both the cluster switches and verify if all the fan modules are operating in good condition, by running the following command:

Switch>enable
Switch#show environment fan detail

Fan:
---------------------------------------------------------------------------
Fan             Model                Hw     Direction       Status
---------------------------------------------------------------------------
Fan1(sys_fan1)  NXA-FAN-30CFM-F      0.0    front-to-back   Ok 
Fan2(sys_fan2)  NXA-FAN-30CFM-F      0.0    front-to-back   Ok 
Fan3(sys_fan3)  NXA-FAN-30CFM-F      0.0    front-to-back   Ok 
Fan4(sys_fan4)  NXA-FAN-30CFM-F      0.0    front-to-back   Ok 
Fan_in_PS1      N2200-PAC-400W       --     front-to-back   Ok
Fan_in_PS2      N2200-PAC-400W       --     front-to-back   Ok
Fan Zone Speed: Zone 1: 0x32

  • If this is a one time event and the alerts are cleared on its own, then this can be safely ignored and no further action is required.
  • If the issue persists, contact NetApp Technical Support for further assistance.

For Issue 2:

  • The SNMP traffic needs to be sent to the management port of the cluster network switch.
  • The cluster port cannot reach the management port of the switch, but ONTAP management port e0M can.
  • Move the node-management LIF to port: e0M.

 

CUSTOMER EXCLUSIVE CONTENT

Registered NetApp customers get unlimited access to our dynamic Knowledge Base.

New authoritative content is published and updated each day by our team of experts.

Current Customer or Partner?

Sign In for unlimited access

New to NetApp?

Learn more about our award-winning Support