AFF A400报告vifmgr.rpc.nblade.timeouts:由于插槽3集群端口出现致命错误而导致的错误
适用场景
- AFF A400
- ONTAP 9.7—ONTAP 9.7P6
问题描述
- 数据服务可能会受到影响、从而导致客户端和应用程序挂载失败
- Node1的EMS报告EMS和VIFMgr中存在大量与网络相关的错误
Sun Jun 14 23:59:05 -0700 [node1: vifmgr: vifmgr.rpc.nblade.timeouts:error]: The Logical Interface Manager (VIFMgr) is not receiving responses from the nblade.
- 自此之后、我们每天都会看到这些错误。
- 最终导致NFS因系统挂起而中断
- 由于此消息是指VIFMgr、因此我们检查了VIFMgr日志、发现在EMS中出现第一个错误之前、进程已超时。
00000013.00ffb00c 020caea1 Tue Jun 30 2020 08:20:38 -07:00 [kern_vifmgr:info:6191] [0x813410700] [NbladeWriter::nitroPcpRpcCall] clnt_call idemp RPC timeout (elapsed time: 30s)
00000013.00ffb00d 020caea1 Tue Jun 30 2020 08:20:38 -07:00 [kern_vifmgr:info:6191] [0x813410700] [NbladeWriter::reportHungNblade] Nblade has not responded to nitro RPCs for 1326210 seconds
00000013.00ffb0cd 020caffd Tue Jun 30 2020 08:21:08 -07:00 [kern_vifmgr:info:6191] [0x813410700] [NbladeWriter::nitroPcpRpcCall] clnt_call idemp RPC timeout (elapsed time: 60s)
00000013.00ffb0ce 020caffd Tue Jun 30 2020 08:21:08 -07:00 [kern_vifmgr:info:6191] [0x813410700] [NbladeWriter::nitroPcpRpcCall] long-running operation: procNum=35; time=60024 ms
00000013.00ffb0d0 020caffd Tue Jun 30 2020 08:21:08 -07:00 [kern_vifmgr:info:6191] [0x80bf0dc00] [NbladeWriter::reportHungNblade] Nblade has not responded to nitro RPCs for 1326240 seconds