FCP SRAM 转储和 IO WQE 故障,Ext_Status 0x16 由同一主机 SID 引起
适用于
- ONTAP 9.x
- FC
问题描述
EMS记录报告IO WQE failure使用Ext_Status 0x16.
- 扩展状态 0x16 进一步表明 Host initiator 已发送中止以清除当前命令队列。
Sun May 14 23:05:37 +0530 [NetApp: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:2b IO WQE failure, Handle 0x1, Type 8, S_ID: 61xx0, VPI: 276, OX_ID: 1B28, Status 0x3 Ext_Status 0x16
Sun May 14 23:05:50 +0530 [NetApp: fct_tpd_work_thread: ems.engine.suppressed:debug]: Event 'fcp.io.status' suppressed 1 times in last 5 seconds.
Sun May 14 23:05:50 +0530 [NetApp: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:3b IO WQE failure, Handle 0x3, Type 8, S_ID: 61xx0, VPI: 276, OX_ID: 2588, Status 0x3 Ext_Status 0x16
Sun May 14 23:06:05 +0530 [NetApp: fct_tpd_work_thread: ems.engine.suppressed:debug]: Event 'fcp.io.status' suppressed 2 times in last 15 seconds.
EMS登录存储报告和正在FCP SRAMP Dump a重置的适配器:
Sun Dec 03 17:08:54 0000 [NetApp: fct_tpd_work_thread_0: fcp.io.status:debug]: STIO Adapter:4b IO WQE failure, Handle 0x1, Type 8, S_ID: 0x61xx0, VPI: 275, OX_ID: 5FBB, Status 0x3 Ext_Status 0x16
Sun Dec 03 17:09:16 0000 [NetApp: fct_tpd_thread_1: scsitarget.fcp.dump:debug]: FCP target SRAM dump generated for adapter 4b, FW Initiated Dump
Sun Dec 03 17:09:16 0000 [NetApp: fct_tpd_thread_1: scsitarget.hwpfct.errorReset:notice]: An error was encountered in the FC target driver on Fibre Channel target adapter 4b. The adapter will be automatically reset to clear the status:0x87800000, status1:0x52004c62, status2:0x610102, DIP:1, RN:1, RDY:1, Dump owner:1 condition.
Sun Dec 03 17:09:16 0000 [NetApp: fct_tpd_thread_0: scsitarget.hwpfct.errorReset:notice]: An error was encountered in the FC target driver on Fibre Channel target adapter 4a. The adapter will be automatically reset to clear the status:0x87800000, status1:0x52004c62, status2:0x610102, DIP:1, RN:1, RDY:1, Dump owner:1 condition.
Sun Dec 03 17:09:16 0000 [NetApp: fct_tpd_thread_0: fcp.io.status:debug]: STIO Adapter 4a resetting with 28 ITNs and 10 commands to drain
Sun Dec 03 17:09:16 0000 [NetApp: fct_tpd_thread_0: scsitarget.fct.reset:notice]: Resetting Fibre Channel target adapter 4a.
Sun Dec 03 17:09:17 0000 [NetApp: fct_tpd_thread_1: scsitarget.hwpfct.dump.saved:notice]: A dump for adapter 4b was stored in /etc/log/fctsli_4b_20231203_170916/fct_fw_4b.dmp.gz.
Sun Dec 03 17:09:17 0000 [NetApp: fct_tpd_thread_1: callhome.fcp.sram.dump:error]: Call home for FCP SRAM DUMP.
Sun Dec 03 17:09:17 0000 [NetApp: fct_tpd_thread_1: scsitarget.fct.reset:notice]: Resetting Fibre Channel target adapter 4b.
Dec 03 17:09:18 0000 [NetApp: fct_tpd_work_thread_0: scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 4a.
Sun Dec 03 17:09:18 0000 [NetApp: fct_tpd_work_thread_0: scsitarget.hwpfct.linkUp:notice]: Link up on Fibre Channel target adapter 4b.
- STIO 挂起 cmd 事件与
state=10和state =5在EMS中报告了相同的SID 0x61xx0
Sun May 14 23:13:42 +0530 [NetApp: fct_tpd_thread_1: fcp.io.status:debug]: STIO Adapter:2b, found hung cmd:0xfffff8188234ab78(state=10, flags=0x2, ctio_sent=2/3,RecvExAddr=0x2ae5, OX_ID=0x1e66, RX_ID=0xffff,SID=0x61xx0, Cmd[28], req_q_free:0)
Wed Dec 20 21:36:21 +0000 [NetApp: fct_tpd_thread_3: fcp.io.status:debug]: STIO Adapter:4d, found hung cmd:0xfffff81e0a7d1038(state=5, flags=0x0, ctio_sent=1/5,RecvExAddr=0x271f, OX_ID=0x5ee9, RX_ID=0xffff,SID=0x61xx0, Cmd[2A], req_q_free:0)
- 注意:state=5: DATAOUT_WAIT 这表示 FC 目标在接受写入请求后正在等待主机返回某些内容;但是,在预期的超时值内没有返回任何内容。
- Rx 和 Tx 功率在存储端口上处于最佳范围内
- 检查主机和交换机之间的任何物理层问题,如 sfp、电缆、配线面板。