由于数据存储库变为"写保护"、Solaris主机上报告IO错误
适用场景
- ONTAP 9.
- Solaris
问题描述
IO Error
已 在Solaris主机上报告。在
dmesg.out
日志中、Host End-
Jun 21 17:19:30 T5ROOT1 mac: [ID 736570 kern.notice] NOTICE: ldoms-vsw0.vport12 unregistered
Jun 21 17:19:30 T5ROOT1 mac: [ID 469746 kern.notice] NOTICE: ldoms-vsw0.vport12 registered
Jun 25 11:43:13 T5ROOT1 scsi: [ID 583741 kern.warning] WARNING: /scsi_vhci (scsi_vhci0):
Jun 25 11:43:13 T5ROOT1 /scsi_vhci/ssd@g600a0980383057506724535a6832522d (ssd1066): Command Timeout on path fp7/ssd@w2002d039ea487302,95: b397bbc4c4703801◄▬▬▬▬
Jun 25 11:43:32 T5ROOT1 scsi: [ID 583741 kern.warning] WARNING: /scsi_vhci (scsi_vhci0):
Jun 25 11:43:32 T5ROOT1 /scsi_vhci/ssd@g600a0980383057506724535a6832524b (ssd1074): Command Timeout on path fp7/ssd@w2002d039ea487302,99: b3dd046fb2503c01 ◄▬▬▬▬
Jun 25 11:44:28 T5ROOT1 scsi: [ID 583741 kern.warning] WARNING: /scsi_vhci (scsi_vhci0):
- 在以上输出中,的最后一个字段
scsi log
(示例b397bbc4c4703801
)是IO Error Numeric Association (ENA)
特定IO事务的。 - 此 ENA 可用于跟踪 主机端FMA中的IO事务
- 重新尝试写入操作、尝试几次后写入操作会成功、在重新尝试IOS期间、主机会生成IO 错误。
Messages
登录以下主机结束报告:
Oraclet $ TZ=Asia/Kolkata fmdump -VA -E b3dd046fb2503cxx | egrep 'Jun|device-path|asses|key =|asc'
Jun 25 2024 11:43:32.514791264 ireport.io.scsi.cmd.disk.tran
device-path = /pci@380/pci@1/pci@0/pci@7/xxx@0/fp@0,0/ssd@w2002d039ea4873xx,99
driver-assessment = retry
Jun 25 2024 11:43:32.515045492 ereport.io.scsi.cmd.disk.dev.rqs.derr
device-path = /pci@340/pci@1/pci@0/pci@5/xxx@0/fp@0,0/ssd@w2003d039ea4873xx,99
driver-assessment = fault
key = 0x7◄▬▬▬▬DATA PROTECT. !!
asc = 0x27◄▬▬▬▬asc/ascq"27h/01h DZT RO BK HARDWARE WRITE PROTECTED"
ascq = 0x1
devid = id1,ssd@n600a0980383057506724535a6832xxxx
driver-assessment = fault◄▬▬▬▬
filter-ratio = 1
op-code = 0xa
cdb = 0xa 0x2 0x74 0xc1 0x8 0x0
pkt-reason = CMD_CMPLT
pkt-phci-reason = CMD_CMPLT
pkt-state = 0x3f
pkt-stats = 0x0
pkt-flags = 0x2420c000
pkt-time = 60
pkt-hrt-dev = 150307
pkt-hrt-hba = 0
pkt-tag = 0
stat-code = 0x2
key = 0x7◄▬▬▬▬DATA PROTECT. !!
asc = 0x27◄▬▬▬▬ asc/ascq "27h/01h DZT RO BK HARDWARE WRITE PROTECTED" https://www.t10.org/lists/asc-num.htm#ASC_27
ascq = 0x1
sense-data = 0x70 0x0 0x7 0x0 0x0 0x0 0x0 0xe 0x0 0x0 0x0 0x0 0x27 0x1 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
skaarssa = 0x2072701003f00xx
info = ssd_sense_key_fail_command◄▬▬▬▬
__ttl = 0x1
__tod = 0x667a5ff9 0x3698df7c
__hrt = 2633577238186035
- 存储端的卷的 状态显示 为
RW
。 EMS
在问题描述期间、中未报告任何错误事件。- 受影响的卷 属于Snap镜像目标。
- 这些SM关系是在看到问题描述之前完全重新同步的。
Tue Jun 25 11:42:49 IST 2024 ResyncTransfer[Jun 25 11:42:16]:050c5b35-2c9d-11ef-9083-d039ea4873xx Operation-Uuid=dcfe3471-32b9-11ef-9083-d039ea4873xx Group=none Operation-Cookie=0 action=End source=SOURCE_SVM:DISK1 destination=DEST-SVM:DISK1_PSDB status=Success bytes_transferred=6503716443 network_compression_ratio=1.0:1 transfer_desc=Logical Transfer with Storage Efficiency - Optimized Directory Mode
之后、这些关系被中断、以使目标卷变为RW。
Tue Jun 25 11:46:55 IST 2024 BreakVolume[Jun 25 11:46:52]:050c5b35-2c9d-11ef-9083-d039ea4873xx Operation-Uuid=837089b8-32ba-11ef-9083-d039ea4873xx Group=none Operation-Cookie=0 action=End source=SOURCE_SVM:DISK1 destination=DEST-SVM:DISK1_PSDB status=Success