存储节点报告错误驱动器发生故障 xDrive(s) 状态为:NodeOffline
适用场景
- NetApp SolidFire存储节点
- NetApp H系列存储节点
- NetApp Element软件
问题描述
以下 NetApp 存储节点之一可能会生成 AutoSupport (ASUP) 案例报告: SFCOMM:SolidFire Alert from <cluster name> (Node Offline) Node Offline nodeID=x
事件日志中显示以下错误代码:
- 错误代码:
driveFailed
|详情:xx drive(s) with state: "NodeOffline" driveID: xx
- 错误代码:
unresponsiveService
|详情:A block service is not responding is reported on each assigned hard drive.
- 错误代码:
unresponsiveService
|详情:A bulk volume service is not responding.
- 错误代码:
driveAvailable
|详情:Node ID xx has xx available drive(s).
- 错误代码:
nodeOffline
|详情:The SolidFire Application cannot communicate with the Storage node having node ID xx.
- 错误代码:
sliceServiceUnhealthy
|详情:A metadata service is unhealthy and SolidFire is attempting to migrate data away from it.
- 错误代码:
blockServiceUnhealthy
|详情:A block service is unhealthy and SolidFire is attempting to migrate data away from it is reported on all drives.
- 节点在不到 (10) 分钟的时间内从离线状态恢复
- 驱动器被暂时标记为故障
注:
- 这可能发生在单个或多个节点上。
- 由于块服务停止,集群主节点无法再与驱动器通信,因此驱动器被标记为“故障”。如果块服务在 5 分半钟内未恢复,驱动器将自动同步,应联系 NetApp 支持人员,以确定是否可以将驱动器重新添加到节点配置中。