由于主机和存储端口上的 MTU 不匹配导致 NFS 数据存储写入问题
适用场景
- NFS
- ESXi
- Linux
- MTU
问题描述
- Linux 挂载的 NFS 共享在写入操作期间挂起,但可以访问。ESXi
- 挂载的 NFS 数据存储可以访问。
- 当两端的 MTU 均设置为 1500 时,不会出现此问题。
- 即使在同一设备上,连接到不同交换机的端口上的 MTU 设置为 9000 也不会出现问题。
- 使用 a0a 端口的 VLAN 时会出现
- 此问题。使用 a0b 端口的 VLAN 时不会出现此问题。
- 所有端口的 MTU 大小均设置为 9000。
- 所有 a0a VLAN 端口(包括成员端口)都会报告 CRC 错误。
vifmgr.cluscheck.crcerrors: Port a0a on node NodeA is reporting a high number of observed hardware errors, possibly CRC errors.
vifmgr.cluscheck.crcerrors: Port a0a-101 on node NodeA is reporting a high number of observed hardware errors, possibly CRC errors.
vifmgr.cluscheck.crcerrors: Port e0e on node NodeA is reporting a high number of observed hardware errors, possibly CRC errors.
vifmgr.cluscheck.crcerrors: Port e0g on node NodeA is reporting a high number of observed hardware errors, possibly CRC errors.
- 可以创建新的虚拟机/存储 vMotion 虚拟机成功,但在传输几 MB 后,传输停止,几分钟后相应任务出现超时错误:
- vmkernel.log:APD 状态间歇性维持
2021-09-28T16:12:33.376Z cpu0:2098712)WARNING: NFS: 337: Lost connection to the server 172.27.143.110 mount point /netapp_test, mounted as 8b08cbd1-435c57cd-0000-000000000000 ("netapp_test")
2021-09-28T16:14:15.781Z cpu0:2098712)NFS: 346: Restored connection to the server 172.27.143.110 mount point /netapp_test, mounted as 8b08cbd1-435c57cd-0000-000000000000 ("netapp_test")
2021-09-28T16:14:15.781Z cpu6:2097603)StorageApdHandler: 507: APD exit event for 0x4314ef9fec20 [8b08cbd1-435c57cd]
2021-09-28T16:14:15.781Z cpu6:2097603)StorageApdHandlerEv: 117: Device or filesystem with identifier [8b08cbd1-435c57cd] has exited the All Paths Down state.