由于交换机上的主机SFP出现故障、ESXi主机上出现路径冗余降级警报
适用场景
- ONTAP 9.
- FC
- ESXi
- Brocade SAN交换机
问题描述
多个NetApp LUN出现主机连接问题、并且路径降级、并且NetApp FC LUN会重新断开连接。
- ESXi主机在vCenter上的VMware端报告以下警报-
This email is to notify you that an alarm has been triggered in your vCenter:
[Warning] Alarm alarm.StorageConnectivityAlarm on Host hostabc.xxx.com
because Path redundancy to storage device naa.600a098xxxxxx46c3f515xxxxxxxx degraded. Path vmhba2:C0:xx:xx0 is down. Affected datastores: xxx-NetApp-xyz..
Alarm name alarm.StorageConnectivityAlarm
Description alarm.StorageConnectivityAlarm
Target Host hostabc.xxx.com
Status Warning (previous status: Normal)
Triggered time 04/03/2024 01:27:05 PM
Path redundancy to storage device naa.600a098xxxxxx46c3f515xxxxxxxx degraded. Path vmhba2:C0:T8:L142 is down. Affected datastores: xxx-NetApp-xyz. Warning 04/04/2024, 11:12:40 AM
存储端的LUN处于联机状态并已映射。
FC端口均已启动、 Rx、Tx值处于最佳范围。
频繁出现EMS消息"fcp.io.status:State=5" EMS中报告stio hung cmd事件且状态为5:
Wed Apr 03 13:02:34 +0200 [NetApp: fct_tpd_thread_5: fcp.io.status:debug]: STIO Adapter:0g, found hung cmd:0xfffff808ed70a770(state=5, flags=0x0, ctio_sent=1/1,RecvExAddr=0x1217d0, OX_ID=0x125, RX_ID=0xffff,SID=0x4105xx, Cmd[2A], req_q_free:3501)
Wed Apr 03 14:41:09 +0200 [NetApp: fct_tpd_thread_4: fcp.io.status:debug]: STIO Adapter:0h, found hung cmd:0xfffff808ed1d8b38(state=5, flags=0x0, ctio_sent=1/1,RecvExAddr=0x11d570, OX_ID=0x735, RX_ID=0xffff,SID=0x4105xx, Cmd[2A], req_q_free:1321)
注意: "state=5:DATAOUT等待 - 表示FC目标正在等待在接受写入请求后从主机返回的内容;但是、在预期超时值内未返回任何内容。"
- 这些科学、技术和信息咨询活动来自两个具体的小岛屿发展中国家。
- SAN交换机上的主机连接端口的 状态为Laser_FLT,表示SFP出现故障。
Index Slot Port Address Media Speed State Proto
============================================================
5 1 5 701400 id N16 Laser_Flt FC