EMS 事件 "netif.tcp.conn.bad.checksum"
适用于
- ONTAP 9.11.1 及更高版本
- 传输控制协议 (TCP)
- 互联网协议 (IP)
- 网络连接
问题描述
- 多个
netif.tcp.conn.bad.checksum:error事件会显示不同的源地址和目标地址:
[node01: kernel: netif.tcp.conn.bad.checksum:error]: TCP packet with bad checksum detected on port e0c. The packet arrived on connection with source address 10.0.0.10 and destination_address 10.0.0.1[node01: kernel: netif.tcp.conn.bad.checksum:error]: TCP packet with bad checksum detected on port e0c. The packet arrived on connection with source address 10.0.0.11 and destination_address 10.0.0.1[node01: kernel: netif.tcp.conn.bad.checksum:error]: TCP packet with bad checksum detected on port e0c. The packet arrived on connection with source address 10.0.0.12 and destination_address 10.0.0.2[node01: kernel: netif.tcp.conn.bad.checksum:error]: TCP packet with bad checksum detected on port e0c. The packet arrived on connection with source address 10.0.0.10 and destination_address 10.0.0.2注:源地址通常是客户端地址。目的地址是节点的网络接口(LIF)地址
Bad TCP cksum也在节点::> system node run -node node_name -command ifstat port_name的 ifstats 中的端口上递增:
-- interface e0c (40 days, 5 hours, 34 minutes, 37 seconds) --
RECEIVE
Total frames: 2047m | Frames/second: 589 | Total bytes: 2625g
Bytes/second: 755k | Total errors: 0 | Errors/minute: 0
...
LRO bytes: 2527g | LRO6 segments: 0 | LRO6 bytes: 0
Bad UDP cksum: 0 | Bad UDP6 cksum: 0 | Bad TCP cksum: 21
Bad TCP6 cksum: 0 | Mcast v6 solicit: 0 | Lagg errors: 0
...
- 已检查其他步骤:
- 验证以下是否有任何 CRC 错误
ifstat,如果可见 CRC 错误,则对连接到端口的电缆/SFP 进行故障排除 - 如果在
ifstat中未报告 CRC 错误:- 如果在 EMS 消息中报告了单个客户端,则调查客户端
- 否则,如果客户端没有趋势,则调查 EMS 消息中提到的客户端 IP 和 LIF 之间的设备
- 确保在所有接口(存储、交换机、主机)上设置了正确的 MTU
- 验证以下是否有任何 CRC 错误
注意:从存储、连接的交换机端口和客户端的受影响端口同时收集 tcpdumps 或数据包跟踪,以确认 TCP 错误校验和。当前没有其他已知方法来排除贡献者或确定原因