在同一抽屉中丢失多个磁盘后聚合失败
适用于
- DS460C
- 多个磁盘故障/缺少磁盘
问题描述
- DS460C 上重新拔插 IOM A 后出现多磁盘死机
- 由于同一个货架抽屉缺少磁盘,聚合似乎失败。
system node run -node node_name -command sysconfig -r示例:
Aggregate aggr_n02_data01 (failed, raid_tec, partial) (block checksums) Plex /aggr_n02_data01/plex0 (offline, failed, inactive) RAID group /aggr_n02_data01/plex0/rg0 (partial, block checksums) RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks) --------- ------ ------------- ---- ---- ---- ----- -------------- -------------- tparity FAILED N/A 9324290/ - dparity 9c.13.6 9c 13 6 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 parity 9a.11.6 9a 11 6 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9c.12.6 9c 12 6 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.10.6 9a 10 6 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.10.30 9a 10 30 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 (reconstruct stalled) data 9c.13.7 9c 13 7 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.11.7 9a 11 7 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9c.12.7 9c 12 7 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.10.7 9a 10 7 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data FAILED N/A 9324290/ - data 9c.13.8 9c 13 8 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.11.8 9a 11 8 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9c.12.8 9c 12 8 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.10.8 9a 10 8 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data FAILED N/A 9324290/ - data 9c.13.9 9c 13 9 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.11.9 9a 11 9 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9c.12.9 9c 12 9 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.10.9 9a 10 9 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data FAILED N/A 9324290/ - data 9c.13.10 9c 13 10 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.11.10 9a 11 10 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9c.12.10 9c 12 10 SA:A 0 FSAS 7200 9324290/19096145920 9342976/19134414848 data 9a.10.10 9a 10 10 SA:B 0 FSAS 7200 9324290/19096145920 9342976/19134414848 Raid group is missing 4 disks.
- 机架中通道 B 上的 DCM 存在问题,导致在重新拔插过程中没有通往单个抽屉中磁盘的路径
Drawers Control Module:
Element Status Status Bytes Status Descriptions
1 [IOM12 A] : OK 01,00,01,00 REPORT
2 [IOM12 A] : OK 01,00,01,00 REPORT
3 [IOM12 A] : OK 01,00,01,00 REPORT
4 [IOM12 A] : OK 01,00,01,00 REPORT
5 [IOM12 A] : OK 01,00,01,00 REPORT
6 [IOM12 B] : CRITICAL 02,40,00,00 FAIL
7 [IOM12 B] : OK 01,00,01,00 REPORT
8 [IOM12 B] : OK 01,00,01,00 REPORT
9 [IOM12 B] : OK 01,00,01,00 REPORT
10 [IOM12 B] : OK 01,00,01,00 REPORT- 重新插入 IOM A 后,聚合仍然失败
::> storage aggregate show -fields state
aggregate state
--------------- ------
aggr_n01_data01 online
aggr_n01_data02 online
aggr_n01_data03 online
aggr_n01_root online
aggr_n02_data01 failed