MetroCluster FC多个磁盘出现故障
适用场景
- MetroCluster FC
- ONTAP 9
- ATTO FibreBridge 7500N/7600N
问题描述
- 由于命令中止的错误过多以及设备超时、多个磁盘出现故障。
- 命令中止的错误和设备超时会出现在同一路径上的磁盘上:
[cl01-n02: isp2400_intrd: scsi.cmd.abortedByHost:error]: Disk deviceswitch1:8.126L29: Command aborted by host adapter: HA status 0x4: cdb 0x9a:0000000014345600:0001:0168.
[cl01-n02: isp2400_intrd: scsi.cmd.abortedByHost:error]: Disk deviceswitch1:8.126L30: Command aborted by host adapter: HA status 0x4: cdb 0x9a:000000003d12d800:0001:0200.
[cl01-n02: isp2400_intrd: scsi.cmd.abortedByHost:error]: Disk deviceswitch1:8.126L38: Command aborted by host adapter: HA status 0x4: cdb 0x9a:000000018ef51200:0001:0200.
[cl01-n02: isp2400_timeout_3: fci.device.timeout:debug]: HBA 1a encountered a device timeout on Disk deviceswitch1:8.126 (0x04070800) LUN 29 cdb 0x9a:0000000014345600:0001:0168 retry: 0
[cl01-n02: disk_server_0: shm.threshold.consecutiveTimeouts:error]: shm: Disk deviceswitch1:8.126L29 has exceeded the threshold of 11 consecutive timeouts; the system will fail the disk if possible.
porterrshow
sfpshow
交换机端口的和输出未显示错误、并且SFP TX/RX值处于正常限制范围内- 连接到错误中所述交换机端口的ATTO FibreBridge日志显示以下错误的多个实例:
INFO FC TM Cmd Rcvd: Abort Task Set to LUN:X on FC Port 1
- 已更换FibreBridge和交换机端口中的SFP、但仍可看到光缆、但磁盘和FibreBridge错误。