SM-BC 关系间歇性不同步
适用于
- ONTAP 9
- SnapMirror 业务连续性 (SM-BC)
问题描述
- 以下事件偶尔会在目标集群上触发:
SMGROS:HA Group Notification from dest_cluster_node (SNAPMIRROR DESTINATION IS OUT OF SYNC MORE THAN 15 MINUTES: dest_iSCSI_SVM:/cg/dest_vol) ALERT
- 在源集群中也可以看到以下事件:
[src_cluster_node: repl_Handle_sync_resl: callhome.syncsm.exception:error]: Call home for SNAPMIRROR SYNCHRONOUS EXCEPTION: Network Latency exceeded threshold limit.
[src_cluster_node: sm_rpl_admin_main: sm.syncmirror.out.of.sync:alert]: Sync granular CG relationship with source path src_iSCSI_SVM:/cg/src_vol and destination path dest_iSCSI_SVM:/cg/dest_vol has transitioned from in-sync to out-of-sync for the follow reason: Replication engine was aborted as part of a consistency group coordinated operation.(Replication engine error).
sm.syncmirror.out.of.sync:alert
紧随其后的是sms.status.in.sync:notice
事件。
- 以下事件可以在 SnapMirror 目标审核日志上看到。
InSyncTransfer[Mar 19 12:15:01]:138f4255-b3b9-11ee-9d67-d039eab0afa9 Operation-Uuid=a2f9e8b2-e5af-11ee-ad6f-d039eab0aa86 Group=CGflexvol Operation-Cookie=0 action=Info Transfer failed.(Replication engine was aborted as part of a consistency group coordinated operation.(Replication engine error(Refer to the corresponding sms.status.out.of.sync EMS generated on the SnapMirror destination cluster for more details on the OutOfSync reason. If the SnapMirror destination does not generate an OutOfSync EMS this could be a transient failure during a resync operation.))).
InSyncTransfer[Mar 19 14:55:15]:138f7463-b3b9-11ee-9d67-d039eab0afa9 Operation-Uuid=a2f9e8b2-e5af-11ee-ad6f-d039eab0aa86 Group=CGflexvol Operation-Cookie=0 action=Info Relationship "1397115a-b3b9-11ee-9d67-d039eab0afa9" is out of sync(Replication engine was aborted as part of a consistency group coordinated operation.(Replication engine error))
- 在源 EMS 日志上可以看到以下事件。
Wed Mar 19 12:12:00 +0800 [src_cluster_node: mgwd: sm.mediator.unreachable:alert]: ONTAP Mediator (IP: 10.10.119.244) is unreachable from cluster <src_cluster>.