由于未响应主机、SolidFire AIQ中的vCenter信息未更新
适用场景
- NetApp HCI
- NetApp SolidFire Active IQ (AIQ)
- 在vCenter中注册的某些主机不再存在
示例:vCenter上dcli com vmware vcenter host list 的输出
Command> dcli com vmware vcenter host list
|--------|------|----------------|-----------|
|host |name |connection_state|power_state|
|--------|------|----------------|-----------|
|host-xxx|host01|NOT_RESPONDING | |
|host-xxx|host02|NOT_RESPONDING | |
|host-xxx|host03|CONNECTED |POWERED_ON |
|host-xxx|host04|CONNECTED |POWERED_ON |
|--------|------|----------------|-----------|
问题描述
- 尚未 更新SolidFire AIQ中vCenter和计算节点的信息
mnode_hci-monitor.log在管理节点(mNode)上 显示错误:
get_nma_and_mnode_stats-directive-monitor-<vCenter-UUID>:[sf.mon.aiq:post_data:245]DEBUG:Published data to AIQ. Response [400]
get_nma_and_mnode_stats-directive-monitor-<vCenter-UUID>:[sf.mon.aiq:post_data:247]ERROR:Failed to send support data to AIQ. HTTP response code [400]
SF-VCAlarm-Monitor:[sf.mon.mediator:get_monitor_pairs:104]ERROR:Exception while speaking to dispatcher: 503 Service Unavailable
- 出现
503 Service Unavailable错误时、 vCenter上的vpxd.log显示错误:
error vpxd[21325] [Originator@6876 sub=Vmomi opID=492a9a9] Caught exception while sending activation result; <<xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx, <TCP '127.0.0.1 : 8085'>, <TCP '127.0.0.1 : 48066'>>, storageSystem-xxx, vim.host.StorageSystem.GetFileSystemVolumeInfo>, N5Vmomi5Fault11SystemError9ExceptionE(Fault cause: vmodl.fault.SystemError
--> )
--> [context]<CONTEXT>[/context]
error vpxd[21325] [Originator@6876 sub=Http2Session #2 opID=492a9a9] [Stream #70042349] Transaction was destroyed before completing in state: 1; The handler probably needs to be fixed to always complete. Now resetting the stream...