在具有静态阈值的ONTAP 版本中报告可更正的内存错误
适用场景
ONTAP版本:
- 9.1P17及更早版本的P
- 9.2所有P版本
- 9.3P10及更早版本的P
- 9.4P5及更早的P版本
平台:
- AFF A800
- AFF A700
- AFF A700/FAS9000
- AFF A300/FAS8200
- AFF A220/FAS27x0
- AFF A200/FAS26x0
- AFF80或FAS80
注意:对于所有其他ONTAP平台和ONTAP版本,请参见: How to recon的memory errors on FAS and AFF systems
问题描述
- 节点报告可更正的ECC错误:
event log show -event *cecc*
Sun Nov 11 08:00:52 GMT [ClusterA-01: idle_thread0: cecc_log_summary_1:warning]: params: {'total_num_ceccs': '56', 'num_ceccs': '3'}
Sun Nov 11 08:18:40 CST [cecc_log.summary:warning]: Total of 303 new correctable ECC errors just reported. You might want to check system memory. 12828 correctable ECC errors reported since booting.
- show-memory-errors-Errors报告同一DIMM上存在多个CECC错误
ClusterA::*>set advanced
ClusterA::*>system node show-memory-errors
Correctable ECC Memory Errors:
Node: ClusterA-01
DIMM CECC Multiple Err
Name Count Same Address
------- ------ ------------
DIMM-1 0 false
DIMM-2 22 false
DIMM-3 0 false
Node: ClusterA-02
DIMM CECC Multiple Err
Name Count Same Address
------- ------ ------------
DIMM-1 0 false
DIMM-2 0 false
DIMM-3 0 false
6 entries were displayed.
- 可能会触发AutoSupport警报
CriticalCECCCountMemErrAlert