由于 CPU 不足,NFS 服务器无响应和大范围 VM BSOD
适用于
- NFS
- 9.15.1 CLUSTER-MODE
问题描述
- NFS 客户端(RHEL/OpenShift)报告"nfs server not responding, still trying"
- Windows VM 出现蓝屏死机
- EMS/系统日志显示:
Sun Nov 02 01:34:14-0500 [dc1h20502:mgwd:rdb.node.starvation:error]: CPU starvation detected in the RDB.Sun Nov 02 01:29:37-0500 [dc1h20502:ksmf_timeout_thread:ksmf.svc.watchdog:debug]: "kSMF service thread held >25(sec) by application for table ksmfRawZapi"Sun Nov 02 01:30:25-0500 [dc1h20502:kernel:Nblade.nfsLongRunningOp:debug]: Detected a long running network process operation. The client IP address:port is 19.14.190.123:719...Sun Nov 02 01:33:40-0500 [dc1h20502:CCMA-Scheduler:perf.ccma.workQ.overrun:debug]: Performance archiver cannot collect objects in a timely manner, for the 1 seconds period.