SwitchFanNotPresent 或 SwitchPowerNotPresent 由 CSHM 针对 Cisco 集群网络交换机报告
适用于
- ONTAP 9
- Cisco 集群网络交换机
问题描述
- CSHM 报告了集群网络交换机的一个或多个风扇模块的 "SwitchFanNotPresent_Alert":
Wed Apr 29 15:53:23 AEST [nodename: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: SwitchFanNotPresent_Alert[switch(XXXXXXXXXXX)/Fan Module-1].
Wed Apr 29 15:28:23 AEST [nodename: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: SwitchFanNotPresent_Alert[switch(XXXXXXXXXXX)/Fan Module-2]
- CSHM 报告了集群网络交换机的一个或多个 PSU 的 "SwitchPowerNotPresent_Alert":
Wed Apr 29 15:28:23 AEST [nodename: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: SwitchPowerNotPresent_Alert[switch(XXXXXXXXXX)/PowerSupply-1].
- 在 CSHM 报告警报期间未执行维护活动。
- 警报会在一段时间后在事件日志中清除:
cluster::> event log show
Time Node Severity Event
------------------- ---------------- ------------- ---------------------------
4/29/2020 15:49:12 nodename ALERT hm.alert.raised: Alert Id = SwitchFanNotPresent_Alert , Alerting Resource = switch(XXXXXXXXXXX)/Fan Module-1 raised by monitor cluster-switch
4/29/2020 16:10:20 nodename NOTICE hm.alert.cleared: Alert Id = SwitchFanNotPresent_Alert , Alerting Resource = switch(XXXXXXXXXXX)/Fan Module-1 cleared by monitor cluster-switch
- 将交换机从监控中移除,然后重新添加以轮询传感器并不能解决此问题。
::> system switch ethernet delete -device <switch_name>
::> system switch ethernet create -device <switch_name> -address <ip_address> -snmp-version <version> -community-or-username cshm1! -model OTHER -type cluster-network
- 风扇模块在两台交换机上都运行良好。
Switch> enable
Switch#show environment fan detail
Fan:
---------------------------------------------------------------------------
Fan Model Hw Direction Status
---------------------------------------------------------------------------
Fan1(sys_fan1) NXA-FAN-30CFM-F 0.0 front-to-back Ok
Fan2(sys_fan2) NXA-FAN-30CFM-F 0.0 front-to-back Ok
Fan3(sys_fan3) NXA-FAN-30CFM-F 0.0 front-to-back Ok
Fan4(sys_fan4) NXA-FAN-30CFM-F 0.0 front-to-back Ok
Fan_in_PS1 N2200-PAC-400W -- front-to-back Ok
Fan_in_PS2 N2200-PAC-400W -- front-to-back Ok
Fan Zone Speed: Zone 1: 0x32
- 从交换机日志("
show tech-support")中,我们可以看到,在报告警报期间,交换机在获取风扇相关信息方面存在问题:
`show system internal platform all`
1)Event:E_DEBUG, length:98, at 034398 usecs after Wed Apr 29 11:07:55 2020
[103] pfm_pss_fan_restore_cfg_info_from_startup(2028):
pss fan config fetch from startup failed
发生原因
- 当集群交换机无法获取风扇相关信息时,可能会发生此问题。
- 集群交换机健康监视器 (CSHM) 只能在对交换机的查询未能返回有关所有风扇或电源的信息时检测到风扇或电源丢失。
解决方案
- 如果这是一次性事件,并且警报已自行清除,则可以安全地忽略此事件,无需执行进一步操作。
- 此问题可能是由于在节点和群集交换机之间运行的管理网络上的拥塞造成的。确保此网络与数据流量隔离。
- 如果此问题仍然存在,