跳转到主内容

远程驱动器在 DWDM 维护后出现故障

Views:
4
Visibility:
Public
Votes:
0
Category:
metrocluster
Specialty:
metrocluster
Last Updated:

适用场景

  • MetroCluster IP
  • ONTAP 9

问题描述

将从 MetroCluster IP 交换机 ISL 端口连接到 DWDM 的缆线更换为:
  • 在 EMS 日志中观察到错误
  • 报告远程驱动器出现故障
  • SyncMirror 丛出现故障,多个磁盘缺少 AutoSupport
 
示例:
  • EMS 日志:

Mon Nov 08 12:32:07 +0100 [ClusterA-02: kernel: iscsi.session.stateChanged:notice]: iSCSI session state is changed to Reconnecting for the target iqn.2016-07.com.netapp:  (type: dr_auxiliary, address: 0.0.0.0:65200). Reason: no ping reply after 5 seconds.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: intr: ctl.session.stateChanged:notice]: iSCSI CAM target layer's session state is changed to terminated for the initiator iqn.1994-09.org.freebsd:  (address: 0.0.0.0). Reason: no ping reply after 5 seconds.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: kernel: iscsi.session.stateChanged:notice]: iSCSI session state is changed to Reconnecting for the target iqn.2016-06.com.netapp:  (type: dr_auxiliary, address: 0.0.0.0:65200). Reason: no ping reply after 5 seconds.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: doneq0: scsi.mcc.adt.ioTransportError:error]: mcc_adt[1] - Transport error during execution of command: HA status 0x13: CAM transport status 0x1b : cdb 0x88:00000001bf178980:00000008.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: doneq0: scsi.mcc.adt.ioTransportError:error]: mcc_adt[1] - Transport error during execution of command: HA status 0x13: CAM transport status 0x1b : cdb 0x88:00000001bf178190:00000008.
Mon Nov 08 12:32:07 +0100 [ClusterA-02: scsi_cmdblk_strthr_admin: scsi.cmd.abortedByHost:error]: Disk device 0v.i2.1L21: Command aborted by host adapter: HA status 0x13: cdb 0x88:00000001bf178980:00000008. 

  • sysconfig -r 输出中所示的顺序相同。
 Plex /ClusterA-02/plex1 (offline, failed, inactive, pool1) RAID group /ClusterA-02/plex1/rg0 (partial, block checksums) RAID Disk Device HA SHELF BAY CHAN Pool Type RPM Used (MB/blks) Phys (MB/blks) --------- ------ ------------- ---- ---- ---- ----- -------------- -------------- dparity 0m.i2.3L13P1 0m 20 12 1 SSD N/A 1799343/3685054464 1799351/3685070848 (fast zeroed) parity 0m.i1.0L14P1 0m 20 13 1 SSD N/A 1799343/3685054464 1799351/3685070848 (fast zeroed) data FAILED N/A 1799343/ - data FAILED N/A 1799343/ - data FAILED N/A 1799343/ - data FAILED N/A 1799343/ - data FAILED N/A 1799343/ - Raid group is missing 5 disks.

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.