StorageGRID 的 HTTP 500 错误和对象不可用
适用于
- NetApp StorageGRID 11.9.0.12
- 基于对象的存储(SW-SG-WEBSCALE-SW)
- 具有强站点一致性的多站点 StorageGRID 部署
- 执行站点/网络隔离测试的环境
问题
在计划的站点隔离测试中,客户在访问特定 S3 存储桶中的对象时观察到大量 HTTP 500 内部服务器错误和对象不可用响应(例如,"s3-prd1",对象:"synchronous-client-access-keys")。尽管存储桶配置了强站点一致性,但仍发生此问题。记录了以下错误和日志条目:
症状和日志输出:
- 隔离事件期间,S3 客户端请求因 HTTP 500 (InternalError) 和 SF 403 错误而失败。
- bycast.log 和相关日志显示:
ERROR OBDR: RefreshLocations CassandraUnavailableException(SELECT ... FROM storagegrid.object_by_uuid WHERE uuid=? - CassandraDriverError(Unavailable): 'Cannot achieve consistency level EACH_QUORUM in DC group' DetailedInfo:[consistency:EACH_QUORUM, responses_received:0, responses_required:2])ERROR OBDR: RefreshLocations CassandraUnavailableException(SELECT ... FROM storagegrid.object_by_uuid WHERE uuid=? - CassandraDriverError(Unavailable): 'Cannot achieve consistency level QUORUM' DetailedInfo:[consistency:QUORUM, responses_received:3, responses_required:4])ERROR S3RQ: S3requesterror(afterresponse): connection=..., Resource=..., HTTPStatusCode(notsent)=500, ErrorMsg=InternalError, Details={asyncPusherror:noconsumer}