由于两个接口链路断开导致 nodeOffline
适用于
- NetApp SolidFire
- NetApp Element 软件
问题描述
- ActiveIQ 报告以下错误:
nodeOffline The SolidFire Application cannot communicate with Storage node having node ID xx.
incorrectBondPortCount Incorrect bond port counts detected on the sipi interfaces of nodes: {xx}
networkConfig Network interface eth1 is down or cable is unplugged.
mtuCheckFailure Failed to send messages between nodes using the maximum configured MTU (9000). Messages between the following nodes failed: { yy -> xx }. Incorrect MTU settings can prevent garbage collection from running and cause performance problems.
- 节点
kern.log
检测到以下消息:
2025-05-09T06:29:44.831556Z node1 kernel: [56392902.434327] mlx5_core 0000:af:00.1 eth1: Link down
2025-05-09T06:29:44.857556Z node1 kernel: [56392902.460327] mlx5_core 0000:af:00.1 eth1: speed changed to 0 for port eth1
2025-05-09T06:29:44.905496Z node1 kernel: [56392902.508267] Bond10G: link status definitely down for interface eth1, disabling it
2025-05-09T06:29:44.905501Z node1 kernel: [56392902.508272] Bond10G: first active interface up!
2025-05-09T06:42:22.509868Z node1 kernel: [56393660.112638] mlx5_core 0000:af:00.0 eth0: Link down
2025-05-09T06:42:22.812705Z node1 kernel: [56393660.415476] mlx5_core 0000:af:00.0 eth0: speed changed to 0 for port eth0
2025-05-09T06:42:22.825152Z node1 kernel: [56393660.427923] Bond10G: link status definitely down for interface eth0, disabling it
2025-05-09T06:42:22.825159Z node1 kernel: [56393660.427930] Bond10G: now running without any active interface!