基于 RHEL/CentOS Docker 的存储节点上的服务失败
适用于
- StorageGRID
- 基于 RHEL/CentOS Docker 的存储节点
- 基础操作系统 RHEL/CentOS 7.7
问题
- 所有基于 RHEL/CentOS Docker 的存储节点上的许多服务都失败,因此无法登录 GMI ;
- 无法查看资源暂时不可用的存储节点网格服务状态错误:
Command: /usr/local/servermanager/reader.rb
Result:
/usr/local/servermanager/reader.rb:28:in `initialize': can't create Thread: Resource temporarily unavailable (ThreadError)
from /usr/local/servermanager/reader.rb:28:in `new'
from /usr/local/servermanager/reader.rb:28:in `block in current_status'
from /usr/local/servermanager/reader.rb:25:in `each'
from /usr/local/servermanager/reader.rb:25:in `current_status'
from /usr/local/servermanager/reader.rb:97:in `<main>'
- 在 bycast-err.log 中找到有关 Cassandra 无法创建新本机线程的错误:
Oct 27 15:46:11 dc1-sn1 [ERROR] org.apache.cassandra.concurrent.SEPWorker:145 - Failed to execute task, unexpected exception killed worker: unable to create new native thread
Oct 27 15:46:11 dc1-sn1 [ERROR] org.apache.cassandra.transport.Message:734 - Unexpected exception during request; channel = [id: 0x0b8287d4, L:/172.23.60.246:9042 - R:/172.23.60.247:38244]
Oct 27 15:46:11 dc1-sn1 #011java.lang.OutOfMemoryError: unable to create new native thread
- 重新启动存储节点 / 基础操作系统无帮助,重新启动期间,基础操作系统日志 /var/log/storagegRid/nodes/<node-name>.log 中可能会显示资源暂时不可用的错误:
[2020-10-28T23:57:25.115742] INFO -- /usr/bin/initSG.sh: fork: retry: Resource temporarily unavailable
[2020-10-28T23:57:26.115841] INFO -- /usr/bin/initSG.sh: fork: retry: Resource temporarily unavailable
[2020-10-28T23:57:28.116122] INFO -- /usr/bin/initSG.sh: fork: retry: Resource temporarily unavailable