跳转到主内容

为什么我们在 Active IQ 中未发现明显错误的情况下收到电子邮件通知?

Views:
Visibility:
Public
Votes:
0
Category:
element-software
Specialty:
solidfire
Last Updated:

适用场景

  • NetApp Element 软件
  • NetApp SolidFire Active IQ :

问题解答

当集群主迁移同时发生集群警报时,警报可能不会立即发送到 Active IQ 。尤其是当警报与释放的集群主节点相关时。警报将保持陈旧状态,直到集群主节点下次迁移并擦除所有警报时为止,即使警报在此期间已解决也是如此。
 
已解决和未解决的警报仍将视为 " 新警报 " ,并且无论在下次迁移集群主节点后是否发出通知,都会发送电子邮件。
 
  • 要在 Active IQ 中查找警报,请转到 " 报告 ">" 错误 "
    • 对于未解决的警报:按 日期排序
    • 对于已解决的警报:按 解决时间排序
    • 或者,也可以筛选 电子邮件中的警报 ID
       
  • 要确定集群主迁移,请执行以下操作:
    • 在 Active IQ 中,转至 "Reporting">"Events"
    • clusterMasterEvent 的筛选器
    • 注意: 事件列表仅跟踪最近 10 , 000 个事件。如果已被覆盖, NetApp 支持部门仍可从存储日志中及时进一步跟踪。

请记住,本文仅介绍延迟电子邮件的行为,而不介绍警报的来源。触发警报的因素可能仍需要进一步调查。

本文还仅与孤立的实例相关。如果警报电子邮件仍重复出现,请查看SolidFire 集群中已解决的重复警报

追加信息

通知邮件示例:

Alert ID: <#>
Severity: error 
Cluster: <CLUSTER NAME>
Occurrence Time: 2021-07-06 15:42:30 UTC
Notification Time: 2021-07-21 19:51:51 UTC
  ->>> 时间戳之间的显著差异

clusterFaultID: 102
Additional Detail:

clusterFaultID: 102
nodeHardwareFaultID: 593
code: nodeOffline
details: The SolidFire Application cannot communicate with Storage node having node ID 1.
severity: error
date: 2021-07-06T15:42:53.164604Z
resolved: true
type: node
nodeID: 1

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.