Vifmgr:从一个集群LIF ping另一个集群LIF时数据包丢失
适用场景
- 集群网络交换机
- ONTAP 9
问题描述
- 对于所有集群节点、会显示类似类型的EMS消息:
Fri Nov 19 18:06:27 +0100 [node1: vifmgr: vifmgr.cluscheck.ctdpktloss:alert]: Continued packet loss when pinging from cluster lif node1_clus2 (node node1) to cluster lif node5_clus1 (node node5)
Thu Dec 23 03:36:41 +0100 [node2: vifmgr: vifmgr.cluscheck.droppedlarge:alert]: Partial packet loss when pinging from cluster lif node2_clus1 (node node2) to cluster lif node6_clus2 (node node6)
Tue Dec 28 16:54:49 +0100 [node3: vifmgr: vifmgr.cluscheck.droppedall:alert]: Total packet loss when pinging from cluster lif node3_clus2 (node node3) to cluster lif node1_clus1 (node node1)
- 症状表示两个集群交换机之间通过交换机间链路(ISL)出现网络流量问题、因为许多集群端口都会报告问题。示例:
::> event show -message-name *vifmgr.cluscheck*
Time Node Severity Event
------------------- ---------------- ------------- ---------------------------
8/24/2022 08:14:27 node_name-01 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-01_clus1 (node node_name-01) to cluster lif node_name-11_clus2 (node node_name-11).
8/23/2022 18:36:43 node_name-12 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-12_clus1 (node node_name-12) to cluster lif node_name-11_clus2 (node node_name-11).
8/23/2022 12:41:38 node_name-11 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-01_clus2 (node node_name-01).
8/23/2022 09:33:27 node_name-02 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-02_clus1 (node node_name-02) to cluster lif node_name-11_clus2 (node node_name-11).
8/23/2022 08:28:35 node_name-11 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-12_clus2 (node node_name-12).
8/21/2022 13:58:34 node_name-12 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-12_clus1 (node node_name-12) to cluster lif node_name-01_clus2 (node node_name-01).
8/21/2022 13:36:54 node_name-01 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-01_clus1 (node node_name-01) to cluster lif node_name-11_clus2 (node node_name-11).
8/21/2022 01:51:56 node_name-01 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-01_clus1 (node node_name-01) to cluster lif node_name-12_clus2 (node node_name-12).
8/21/2022 01:08:57 node_name-11 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-01_clus2 (node node_name-01).
8/21/2022 01:08:57 node_name-11 ALERT vifmgr.cluscheck.ctdpktloss: Continued packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-01_clus2 (node node_name-01).
8/20/2022 22:48:56 node_name-11 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-01_clus2 (node node_name-01).
8/20/2022 22:48:56 node_name-11 ALERT vifmgr.cluscheck.ctdpktloss: Continued packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-01_clus2 (node node_name-01).
8/20/2022 22:11:29 node_name-02 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-02_clus1 (node node_name-02) to cluster lif node_name-12_clus2 (node node_name-12).
8/20/2022 10:58:50 node_name-11 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-01_clus2 (node node_name-01).
8/20/2022 01:39:14 node_name-01 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-01_clus1 (node node_name-01) to cluster lif node_name-12_clus2 (node node_name-12).
8/20/2022 01:39:14 node_name-11 ALERT vifmgr.cluscheck.droppedlarge: Partial packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-12_clus2 (node node_name-12).
8/20/2022 01:39:14 node_name-11 ALERT vifmgr.cluscheck.ctdpktloss: Continued packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-12_clus2 (node node_name-12).
8/19/2022 17:29:32 node_name-11 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-12_clus2 (node node_name-12).
8/19/2022 17:29:32 node_name-11 ALERT vifmgr.cluscheck.ctdpktloss: Continued packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-12_clus2 (node node_name-12).
8/18/2022 21:13:36 node_name-11 ALERT vifmgr.cluscheck.droppedall: Total packet loss when pinging from cluster lif node_name-11_clus1 (node node_name-11) to cluster lif node_name-12_clus2 (node node_name-12).
20 entries were displayed.
- 基于以上示例、问题描述 e.ge之类 的事件始终发生在 一个节点的集群LIF _ clus1和 另一个节点的集群LIF _ clus2之间、反之亦然
- 所有节点的_ clus1端口连接到一个集群交换机、 _clus2 端口连接到另一个集群交换机
- 一次禁用一个ISL端口 、如果返回错误消息、请使用集群ping进行检查。示例:
::> set advanced
::*> cluster ping-cluster
- 已找到故障ISL连接并检查链路特定的硬件部件