单个磁盘被两个 IOM 模块绕过且存在关键盘架故障
适用于
- AFF 系统
- FAS 系统
问题描述
磁盘被两个 IOM 绕过:
Thu Aug 03 02:01:10 +0900 [Nodename: dsa_worker5: ds.sas.drivephy.disableErr:error]: Disk [NETAPP :X380_STATE10TA07:NA00] S/N [ABCDE12345678] on channels 0b shelf ID 1 IOM A bay 10 disabled due to excessive phy reset problems.
Thu Aug 03 02:01:19 +0900 [Nodename: dsa_worker2: ds.sas.drivephy.disableErr:error]: Disk [NETAPP :X380_STATE10TA07:NA00] S/N [ABCDE12345678] on channels 0a shelf ID 1 IOM B bay 10 disabled due to excessive phy reset problems.- 升级机架固件后出现严重机架故障:
::> storage shelf show
Module
Shelf Name Shelf ID Serial Number Model Type Status
----------------- -------- -------------------------- ------ -----------
1.10 10 ABCDE12345678 DS460-12 IOM12B Critical
- 从
storage show fault命令的输出中可以看出以下错误,可以看出 IOM 模块 A 和 B 都绕过了单个磁盘:
Enclosure Status: critical
Channel: 0a
Shelf: 10
Shelf Type: DS460-12
Product Serial Number: ABCDE12345678
Module Type: IOM12B
Disk Elements:
Element Status Status Bytes Status Descriptions
0 [Bay 0]: OK 01,00,00,00
1 [Bay 1]: OK 01,01,00,00
2 [Bay 2]: OK 01,02,00,00
3 [Bay 3]: OK 01,03,00,00
4 [Bay 4]: OK 01,04,00,00
5 [Bay 5]: OK 01,05,00,00
6 [Bay 6]: OK 01,06,00,00
7 [Bay 7]: OK 01,07,00,00
8 [Bay 8]: OK 01,08,00,00
9 [Bay 9]: OK 01,09,00,00
10 [Bay 10]: OK 01,0A,00,00
11 [Bay 11]: OK 01,0B,00,00
12 [Bay 0]: OK 01,00,00,00
13 [Bay 1]: OK 01,01,00,00
14 [Bay 2]: OK 01,02,00,00
15 [Bay 3]: OK 01,03,00,00
16 [Bay 4]: OK 01,04,00,00
17 [Bay 5]: OK 01,05,00,00
18 [Bay 6]: OK 01,06,00,00
19 [Bay 7]: OK 01,07,00,00
20 [Bay 8]: OK 01,08,00,00
21 [Bay 9]: OK 01,09,00,00
22 [Bay 10]: CRITICAL 02,0A,30,2C ENCLOSURE BYPASSED B, ENCLOSURE BYPASSED A, BYPASSED B, BYPASSED A, FAULT REQSTD
- 或者 DISK(16.10)可能不再显示在
sysconfig-a和sysconfig-d中。
sysconfig-a 16.9 : NETAPP X343_TA15E1T8A10 NA01 1713.5GB 520B/sect () 16.11: NETAPP X343_TA15E1T8A10 NA01 1713.5GB 520B/sect ()
sysconfig-d 0c.16.9 0c 16 9 SA:A *** 0c.16.11 0c 16 11 SA:A ***