BES-53248由于内存泄漏而意外重新启动
适用场景
- BES-53248集群交换机
- EFOS 3.11.1.2
问题描述
- BES-53248集群互连交换机 意外重新启动、未发生电源中断或未执行电源维护:
- 可用内存会在6到7个月内减少、最终导致重新启动。
------------------------ Print file - slog0.txt ------------------------
<185> Jun 4 05:42:17 0.0.0.0-1 SIM[Cnfgr_Thread_]: sim_util.c(4302) 12 %% ALRT Switch was reset due to power disruption or unexpected restart.(error[0x0]).
- 配对交换机报告以下"ISL down (ISL关闭)"警报:
[Node-01: cshmd: hm.alert.raised:alert]: Alert Id = ClusterIfIslDownWarn_Alert , Alerting Resource = CS2/Slot: 0 Port: 55 100G - Level raised by monitor ethernet-switch
[Node-01: cshmd: hm.alert.raised:alert]: Alert Id = ClusterIfIslDownWarn_Alert , Alerting Resource = CS2/Slot: 0 Port: 56 100G - Level raised by monitor ethernet-switch
[Node-01: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process cshm: ClusterIfIslDownWarn_Alert[CS2/ Link Aggregate 1].
- 连接的ONTAP节点报告以下错误消息:
Sun Jan 19 05:33:39 +0800 [nodename: kernel: netif.linkDown:info]: Ethernet e0c: Link down, check cable.
Sun Jan 19 05:33:39 +0800 [nodename: cf_main: cf.fsm.takeoverOfPartnerDisabled:error]: Failover monitor: takeover of partnernode disabled (unsynchronized log).
并使用生成了ASUP CLUSTER NETWORK DEGRADED , Reason Takeover not Possible
- 交换机恢复联机并清除警报后、ISL链路将启动。