因SFP/缆线故障而导致集群互连延迟较长
适用场景
问题描述
- 用户报告NetApp卷延迟较长。
- QoS统计信息报告 集群互连延迟。
cluster1::> qos statistics volume latency show -iterations 100
Workload ID Latency Network Cluster Data Disk QoS NVRAM
--------------- ------ -------- -------- -------- -------- -------- -------- ----------
-total- 2320.45ms 109.00us 2320.02ms 327.00us 32.00us 0ms 0ms
vdisk_vol1 111 2214.05ms 135.00us 2213.22ms 603.00us 89.00us 0ms 0ms
vol2 1234 2701.12ms 238.00us 2700.56ms 191.00us 133.00us 0ms 0ms
- 根据SFP故障的类型、这可能并不总是触发EMS事件
- EMS日志可能会报告以下错误:
Mon Nov 21 08:31:41 +0900 [ST-BNC-5708N2: vifmgr: callhome.clus.net.degraded:alert]: Call home for CLUSTER NETWORK DEGRADED: Large MTU Packet Loss - Ping failures detected between cluster-02_clus2 ( xxx.xxx.xx.xx ) on Node2 and cluster-01_clus2 ( xxx.xxx.xxx.xxx ) on Node1