跳转到主内容

ONTAP FC 端口显示低接收功率 (RX) 值

Views:
126
Visibility:
Public
Votes:
0
Category:
fabric-interconnect-and-management-switches
Specialty:
san
Last Updated:

适用于

  • ONTAP
  • FCP
  • Brocade SAN 交换机
  • Cisco SAN 交换机

问题描述

  • 根据 fcp adapter show 输出,FC 端口在 ONTAP 上显示低 RX 值:
::> fcp adapter show -instance -node nodename -adapter 0a [...] Received Optical Power: 241.3 (uWatts) Is Received Power In Range: false SFP Transmitted Optical Power: 600.4 (uWatts) Is Xmit Power In Range: true DDM Status: 30 Is Xmit Disabled: false  Is Xmit In Fault: false Is Receiver In LOS: false
  • FCP-Adapter.xml 登录 ONTAP 报告低 Rx 功耗,表明问题在 ONTAP 上游:
clipboard_ec0f0a4c94ba65f31b0b55224ef860c61.png
 
  • 以下风险在 AIQM 中报告-
    •  RISK: SFP receive power of the SFP connected to netapp port 9c is below the recommended value. 


• SFP 两端都正常。Tx 和 Rx 均在两端的推荐范围内。

  • EMS 日志中,我们观察到 link break detectedlink up 错误以及 AEN 错误:

Wed Apr 12 03:42:08 +0200 [nxx-0x: fct_tpd_work_thread_0: scsitarget.slifct.linkBreak:error]: Link break detected on Fibre Channel target HBA 2c with event status 1 , topology type 1, status1 0x0, status2 0x0.

Wed Apr 12 03:42:09 +0200 [nxx-0x: fct_tpd_work_thread_0: scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 2c.

fcp.io.status:debug]: STIO Adapter:0e AEN 0x8048 (RECV_ERROR) MboxStatus1 0x1002 MboxStatus2 0xa1

  • 从 Brocade 交换机的 sfpshow 日志:

RX Power:   -1.9   dBm (647.5uW) 
TX Power:   -4.6   dBm (349.3 uW)

  • `show interface brief`logs 连接到受影响的 NetApp 端口的 Cisco 交换机端口状态报告为"errDisabled"

-----------------------------------------------------------------------------------------
Interface  Vsan   Admin  Admin   Status     SFP   Oper  Oper   Port    Logical
          Mode   Trunk             Mode  Speed  Channel   Type
             Mode                (Gbps)
-----------------------------------------------------------------------------------------
fc1/27    3    auto   off   errDisabled swl   --    --    --     --

  • show int transceiver detail 输出在 Cisco 交换机上报告低 Tx 功率,表明交换机端 sfp 存在问题:

   SFP Diagnostics Information:
----------------------------------------------------------------------------
                   Alarms          Warnings
                 High     Low      High      Low
----------------------------------------------------------------------------
  Temperature   40.16 C      75.00 C    -5.00 C    70.00 C     0.00 C
  Voltage     3.33 V       3.63 V    2.97 V    3.46 V     3.13 V
  Current     0.00 mA      12.00 mA    4.00 mA   11.20 mA     4.80 mA
 Tx Power    -40.00 dBm     5.00 dBm  -12.20 dBm   2.00 dBm    -8.20 dBm 
  Rx Power    -0.49 dBm      5.00 dBm  -15.20 dBm   2.00 dBm   -11.20 dBm 
  Transmit Fault Count = 0
----------------------------------------------------------------------------
  Note: ++  high-alarm; +  high-warning; --  low-alarm; -  low-warning

  • Cisco 交换机端口的 Rx 和 Tx 值显示为 optimal,端口状态为 up。

fc2/42 is up
[...
....]
Temperature 38.33 C, Voltage 3.38 V, Current 6.54 mA, TxPower -6.08 dBm, RxPower -3.11 dBm

  • 交换机端的端口已切换,但 NetApp 端 FC 端口的 Rx 功率仍然很低。

Tue May 28 23:35:39 2024:type=update:id=10.0.xx.xyz@pts/0:user=admin:cmd=configure terminal ; interface fc2/42 (SUCCESS)
Tue May 28 23:35:57 2024:type=update:id=10.0.xx.xyz@pts/0:user=admin:cmd=Interface fc2/42 state updated to down
Tue May 28 23:35:57 2024:type=update:id=10.0.xx.xyz@pts/0:user=admin:cmd=configure terminal ; interface fc2/42 ; shutdown (SUCCESS)
Tue May 28 23:41:05 2024:type=update:id=10.0.xx.xyz@pts/0:user=admin:cmd=Interface fc2/42 state updated to up
Tue May 28 23:41:05 2024:type=update:id=10.0.xx.xyz@pts/0:user=admin:cmd=configure terminal ; interface fc2/42 ; no shutdown (SUCCESS)

  • System Manager 中适配器的低吞吐量
  • 作为故障排除的一部分,可以执行以下步骤:
    • ONTAP 上的低 RX 通常表示发射端(交换机/配线面板)的硬件故障
    • 进行电缆测试和配线面板检查,以确认电缆是否正常
    • 将电缆交换到交换机端的另一个可用工作端口,查看问题是否依然存在
    • 清洁电缆和两个 SFP 上的连接

 

  • VMware 服务器频繁进入 Not responding 状态。
  • 以下是存储端的错误 -
[?] Tue May 20 01:35:58 +0530 [cx-labs-bgl-aff-03: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:1a IO WQE failure, Handle 0x0, Type 8, S_ID: 3002xx, VPI: 3, OX_ID: 4D20, Status 0x3 Ext_Status 0x1d [?] Tue May 20 01:36:13 +0530 [cx-labs-bgl-aff-03: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:1a IO WQE failure, Handle 0x0, Type 8, S_ID: 3002xx, VPI: 3, OX_ID: 4051, Status 0x3 Ext_Status 0x1d
  •  在主机端观察到性能问题。以下是主机端报告的错误:
2025-05-23T06:38:20.431Z cpu85:2098417)ScsiCore: 1823: Power-on Reset occurred on naa.600a098038314a71692454716472xxxx 2025-05-23T06:38:23.249Z cpu24:2098418)ScsiDeviceIO: 4115: Cmd(0x45d9d4a9b6c8) 0x8a, CmdSN 0x800e002e from world 2125982 to dev "naa.600a098038314a71692454716472xxxx" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x6 0x29 0x0 2025-05-23T06:38:41.558Z cpu30:3060886 opID=f36de402)UserDump: 2823: hostd-worker: Userworld(hostd-worker) coredump complete.

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.