跳转到主内容

使用FC LUN的主机上报告"IO操作错误"

Views:
5
Visibility:
Public
Votes:
0
Category:
fas-systems<a>2009560108</a>
Specialty:
SAN
Last Updated:

适用场景

  • ONTAP
  • FC
  • Brocade Fabric OS 7.4.2D版
  • Windows
  • AIX

问题描述

  • IO operation Error 153 在Windows主机和 Disk Operation Error AIX端报告、路径和LUN在几分钟内无法访问并自动恢复。
    • Windows主机 主机端生成磁盘暂停错误、  并观察到IO滞后、并且主机无法读取或写入磁盘。
    • AIX主机 报告  主机端的路径变化、并且在报告磁盘操作错误的路径上无法访问磁盘。
  • 存储报告 IO WQE error 具有扩展状态 0x20x1d 位于 存储端的同一FC端口上
  • 这两台主机使用同一个FC端口访问存储。

Tue Nov 30 20:17:47 +07 [NetApp: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:2b IO WQE failure, Handle 0x1, Type 8, S_ID: 79Dxx0, VPI: 275, OX_ID: B63, Status 0x3 Ext_Status 0x1d
Mon Sep 26 13:26:52 +07 [NetApp: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:2b IO WQE failure, Handle 0x1, Type 8, S_ID: 79Dxy0, VPI: 275, OX_ID: 8AD, Status 0x3 Ext_Status 0x1d
Tue Jan 03 16:36:22 +07 [NetApp: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:2b IO WQE failure, Handle 0x1, Type 8, S_ID: 79Dxx0, VPI: 275, OX_ID: DA3, Status 0x3 Ext_Status 0x2 
Fri Jan 13 12:05:37 +07 [NetApp: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:2b IO WQE failure, Handle 0x1, Type 8, S_ID: 79Dxy0, VPI: 275, OX_ID: 459, Status 0x3 Ext_Status 0x1d 

  • 除了IO wqe错误之外、存储端不会显示任何其他错误事件。
  • 在Brocade交换机 端、远程E端口已手动禁用、并且 与端口关联的分区过多、在存储端发现IO wqe错误。

switchshow        :
switchName:     XXXXY
switchType:     62.0
switchState:    Online
switchMode:     Native
switchRole:     Principal
switchDomain:   102
Index Slot Port Address Media  Speed        State    Proto
============================================================
150    2   22   661400   id    8G         No_Light    FC  LS Disabled (Persistent)

  • ELP,EFPFabriclog 在Brocade交换机的输出下存在端口转接和拒绝出现的迹象: 

00:19:55.386609 *ELP Snd ACC:rev=2,flow=1,flen=80,sz=164 .. F0,P1  F0,P2  166   0x8937
00:19:55.386610 op_mode=0x5580                F0,P1  F0,P2  166   0x8937
00:19:55.386803 BF ACC Rcv                  F0,P3  F0,P3  318   0x6ad3
00:19:55.386852 SCN Domain 102 invalid            F0,NA  F0,NA  NA   NA
00:19:55.386877 ELP RJT Rcv - ct prfrm,in prgs,vu=0      F0,P2  F0,P2  166   0x6ad4
00:19:55.391604 SCN Port Offline;g=0x1c            F0,P2  F0,P0  150   NA
00:19:55.391611 *Removing all nodes from port         F0,P0  F0,P0  150   NA

 

继续操作前、请考虑以下事项
  • 如果FOS版本 较旧、  应考虑升级。升级FOS有助于修复光纤上的许多已知漏洞和问题。
  • 如果连接 到大量设备的分区过多、并且如果发生任何具有交换机间连接的问题描述、则很可能会导致帧问题描述无序、我们将在此处观察
  • 清理不必要的分区、仅保留必要的分区
  • NetApp建议采用1:1分区、确保AIX主机和Windows主机未分区到同一目标端口。
  • status  0x02 和  ​​​​ 均0x1d 表示无序FC帧序列、通常表示网络结构中的问题描述。

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.