聚合脱机,但 AWS 中的单节点 Cloud Volumes ONTAP 中缺少磁盘
适用场景
- Cloud Volumes ONTAP ( CVO )
- AWS
问题描述
- 聚合在
failed
AWS Cloud Volumes ONTAP 环境中显示:
Your_AWS_node::> aggr show
Aggregate Size Available Used% State #Vols Nodes RAID Status
--------- -------- --------- ----- ------- ------ ---------------- ------------
aggr0_Your_AWS_node_01
124.0GB 6.01GB 95% online 1 Your_AWS_node-01 raid0,
normal
aggr1 0B 0B 0% failed 0 Your_AWS_node-01 raid0,
partial
2 entries were displayed.
- 在 EMS 中会显示以下消息:
12/9/2021 15:48:15 Your_AWS_node-01 ALERT sk.panic: Panic String: aggr aggr1: raid volfsm, fatal disk error in RAID group with no parity disk..
Raid type - raid0
Group name plex0/rg0 state NORMAL. 1 disk failed in the group.
Disk 0b.6 S/N [00000000V-xNBTca/C86] UID [00000000V-xNBTca/C86] error: no valid path to disk. in SK process config_thread on
release 9.10.1RC1 (C)
12/9/2021 15:48:15 Your_AWS_node-01 ERROR scsi.cmd.adapterHardwareErrorEMSOnly: Disk device 0b.6L0: Adapter detected hardware error: HA status 0x6: cdb 0x2a:00034c30:0078. Disk 0b.6 S/N [00000000V-xNBTca/C86] UID [00000000V-xNBTca/C86] Target Address [nvme (null)6]
12/9/2021 15:48:15 Your_AWS_node-01 ERROR scsi.cmd.abortedByHost: Disk device 0b.6L0: Command aborted by host adapter: HA status 0x4: cdb 0x2a:00034c30:0078. Disk 0b.6 S/N [00000000V-xNBTca/C86] UID [00000000V-xNBTca/C86] Target Address [nvme (null)6]
disk show
输出不会按预期列出构成聚合的所有磁盘。在此示例中, aggr1 只能看到 5 个磁盘,但我们从先前的 AutoSupport 中知道聚合中应有 6 个磁盘:
Your_AWS_node::> disk show
Usable Disk Container Container
Disk Size Shelf Bay Type Type Name Owner
---------------- ---------- ----- --- ------- ----------- --------- --------
NET-1.1 137.8GB - - VMDISK aggregate aggr0_Your_AWS_node_01
Your_AWS_node-01
NET-1.3 1007GB - - VMDISK aggregate aggr1 Your_AWS_node-01
NET-1.4 1007GB - - VMDISK aggregate aggr1 Your_AWS_node-01
NET-1.5 1007GB - - VMDISK aggregate aggr1 Your_AWS_node-01
NET-1.7 1007GB - - VMDISK aggregate aggr1 Your_AWS_node-01
NET-1.8 1007GB - - VMDISK aggregate aggr1 Your_AWS_node-01
6 entries were displayed.
在此示例disk show
中,我们可以看到,在将当前输出与先前的 AutoSupport 进行比较时,磁盘 NET-1.6 丢失。