从 NS224 盘架的 NSM100 上检测到 DIMM 错误
适用于
- ONTAP 9
- NS224 机架
- NSM100
- NSM100B
问题
- 有关 NSM 模块上 DIMM 故障的事件日志中报告以下错误:
[Node-01: dsa_worker3: ses.status.dimm.error:error]: NS224NSM100 (S/N SHFGBXXXXXXXX) shelf 0 on channel 0x DIMM failure for Dimm Element 8: not installed or failed. This element is on the DIMM slot 4 in the bottom shelf module (B).
[Node-01: dsa_worker3: ses.status.dimm.threshErr:alert]: NS224NSM100B shelf 1 on channel 0x DIMM threshold failure error for Dimm Element 3: critical status. This element is on the DIMM slot 1 in the bottom shelf module (B).
cluster::> storage shelf show -instanceDIMM:ID Mod Type Size Speed Status Location--- --- ---- ---- ------- ------------ -------------------------------1 A DIMM 8GB 2666Mhz normal DIMM slot 1 in the top shelf module (A)2 A DIMM 8GB 2666Mhz normal DIMM slot 2 in the top shelf module (A)3 A DIMM 8GB 2666Mhz normal DIMM slot 3 in the top shelf module (A)4 A DIMM 8GB 2666Mhz normal DIMM slot 4 in the top shelf module (A)5 B DIMM 8GB 2666Mhz normal DIMM slot 1 in the bottom shelf module (B)6 B DIMM 8GB 2666Mhz normal DIMM slot 2 in the bottom shelf module (B)7 B DIMM 8GB 2666Mhz normal DIMM slot 3 in the bottom shelf module (B)8 B error DIMM slot 4 in the bottom shelf module (B)- 从
storage show fault输出中,状态报告为未知或严重:
cluster::> system node run -node <node> storage show fault
DIMM:
Element Status Status Bytes Status Descriptions
1 [NSM100 A] : OK 01,00,00,00
2 [NSM100 A] : OK 01,00,00,00
3 [NSM100 A] : OK 01,00,00,00
4 [NSM100 A] : OK 01,00,00,00
5 [NSM100 B] : OK 01,00,00,00
6 [NSM100 B] : OK 01,00,00,00
7 [NSM100 B] : OK 06,00,00,00
8 [NSM100 B] : UNKNOWN 01,00,00,00
DIMM:Element Status Status Bytes Status Descriptions 8 [NSM100 B] : NOT AVAILABLE 07,00,00,00
DIMM:Element Status Status Bytes Status Descriptions 3 [NSM100B B] : CRITICAL 02,00,00,01
- 您可能会看到磁盘冗余失败警报