非现有交换机的系统运行状况警报中报告了 ClusterSwitchConfig_Alert

最后更新
另存为PDF

Views:: 14

Visibility:: Public

Votes:: 0

Category:: fabric-interconnect-and-management-switches<a>ClusterSwitchConfig_Alert</a><a>2008891639</a>

Specialty:: hw

Last Updated:

适用场景

ONTAP 9
集群交换机运行状况监控器（ CSHM ） AutoSupport 消息

问题描述

交换机的子系统运行状况降级：

cluster1::> system health subsystem show Subsystem Health ----------------- ------------------ SAS-connect ok Environment ok Memory ok Service-Processor ok Switch-Health degraded CIFS-NDO ok Motherboard ok IO ok MetroCluster ok MetroCluster_Node ok FHM-Switch ok FHM-Bridge ok SAS-connect_Cluster ok

system health alert show example:

Node: node_name1 Resource: node_name2 Severity: Major Indication Time: Fri Aug 13 23:03:53 2021 Suppress: false Acknowledge: false Probable Cause: One or more nodes are not connected to both cluster switches. Possible Effect: If one cluster switch fails, "node_name2" might lose access to the cluster. Corrective Actions: Ensure the switch "no_switch_name1" is connected to the node "node_name2". Node: node_name1 Resource: no_switch_name1 Severity: Major Indication Time: Fri Aug 13 23:29:53 2021 Suppress: false Acknowledge: false Probable Cause: Cluster switch "no_switch_name1" with IP address "123.123.123.123" is not reachable via SNMP. Incorrect SNMP community string might be configured on the cluster switch. Possible Effect: Cluster switch communication problems and accessibility issues. Corrective Actions: Check the SNMP community string on the cluster switch to verify the expected community string is configured. Use the "system cluster-switch show -snmp-config" command to view the expected community string.

警报中的交换机名称和IP地址不属于任何集群交换机、或者在无交换机集群的情况下报告。
EMS 日志示例：

Fri Aug 13 23:05:50 +0100 [node_name1: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: ClusterSwitchConfig_Alert[node_name2].
 Fri Aug 13 23:30:51 +0100 [node_name1: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: SwitchCommunityString_Alert[no_switch_name1].
 Fri Aug 13 23:03:54 +0100 [node_name1: cshmd: hm.alert.raised:alert]: Alert Id = ClusterSwitchConfig_Alert , Alerting Resource = node_name2 raised by monitor cluster-switch
 Fri Aug 13 23:29:53 +0100 [node_name1: cshmd: hm.alert.raised:alert]: Alert Id = SwitchCommunityString_Alert , Alerting Resource = no_switch_name1 raised by monitor cluster-switch
 Fri Aug 13 23:29:53 +0100 [node_name1: cshmd: hm.alert.raised:alert]: Alert Id = SwitchCommunityString_Alert , Alerting Resource = no_switch_name2 raised by monitor cluster-switch

network port show -node * -role cluster -fields remote-device-id 报告正确的集群网络交换机。