如何解决 VMware ESXi 上的 NFS APD 问题
适用场景
- ONTAP 9
- VMware ESXi
问题描述
- 如果在任何给定TCP流上存在5秒的时间段且未进行通信、则会启动All Paths Down (APD)计时器
- 处于此状态140秒后、连接将被视为丢失、并且达到APD超时
- VMware日志中可能出现的错误 包括但不限于:
YYYY-MM-DD T00:26:51.504Z: [APDCorrelator] xxxxxxxxxxxxxus: [esx.problem.storage.apd.timeout] Device or filesystem with identifier [xxxxxxxx-xxxxxxxx] has entered the All Paths Down Timeout state after being in the All Paths Down state for 140 seconds. I/Os will now be fast failed.
NFSLock: 608: Stop accessing fd 0x410011446d28 3
NFS: 133: Lost connection to the server 192.168.0.1 mount point /vol/datastore, mounted as xxxxxxxxx-xxxxxxxx-0000-000000000000 (“datastore”)
StorageApdHandler: 248: APD Timer started for ident [xxxxxxxx-xxxxxxxx]
StorageApdHandler: 846: APD Start for ident [xxxxxxxx-xxxxxxxx]!
StorageApdHandler: 277: APD Timer killed for ident [xxxxxxxx-xxxxxxxx]
StorageApdHandler: 902: APD Exit