StorageGRID 节点扩展卡在"启动服务"
适用于
NetApp StorageGRID
问题描述
- StorageGRID 节点扩展停滞于:
Starting service
Error: Failed to start node services.

storagegrid-status,报告在受影响的扩展存储节点上 RSM 服务卡在启动状态:

servermanager.log在受影响的 Storage Node 上:
2023-12-01 12:34:56 +0000 | rsm | RSM is not ready because there is no cluster or the cluster has no leader
bycast.log在受影响的 Storage Node 上:
Dec 1 12:34:56 dc1-sn5 rsm[123456]: [cluster.go:407:] WARNING: HTTP status '500' while posting raft join message to 192.168.1.3:18003
Dec 1 12:34:56 dc1-sn5 rsm[123456]: [cluster.go:403:] WARNING: Error 'Post "https://192.168.1.2:18003/v1/raft/join?id=123456": dial tcp 192.168.1.2:18003: connect: connection refused' while posting raft join message to 192.168.1.2:18003