跳转到主内容

运行状况监控进程nchm:StorageFCAdapterFault_Alert

Views:
3
Visibility:
Public
Votes:
0
Category:
metrocluster
Specialty:
metrocluster
Last Updated:

适用场景

  • 光纤连接 MetroCluster
  • ONTAP 9

问题描述

  • EMS报告 Health Monitor process nchm: StorageFCAdapterFault_Alert

Sun May 15 02:27:00 HKT [nodeA: mgwd: callhome.hm.alert.major:alert]: Call home for Health Monitor process nchm: StorageFCAdapterFault_Alert[100000109b42235f].

  • 访问磁盘时、EMS中存在超时错误。

Sun May 15 02:09:19 HKT [nodeA: slifc_timeout_2: fci.device.quiesce:debug]: Adapter 2d encountered a command timeout on Disk device T1_Brocade6505B:9.126 (0x02080900) LUN 62 cdb 0x9a:000000002d562200:0001:0200 retry: 0 Quiescing the device.
Sun May 15 02:09:20 HKT [ndoeA: slifc_timeout_2: fci.device.timeout:debug]: HBA 2d encountered a device timeout on Disk device T1_Brocade6505B:9.126 (0x02080900) LUN 62 cdb 0x9a:000000002d562200:0001:0200 retry: 0

  • 已删除的端口上出现大量传输错误。

hard_reset_count                29
Manual adapter dump count 0
Auto adapter dump count 0
firmware_fault_count            0
firmware_pause_count            0
device status:           60900  80900
  link_fail_count             0      0    total:  0
  lip_count                   0      0    total:  0
  underrun_count              0      0    total:  0
  overrrun_count              0      0    total:  0
  transport_error_count    2085   1407    total:  3492
  crc_error_count             0      0    total:  0
  victim_abort_io_count      24     23    total:  47
  timeout_io_count           36      1    total:  37
  logged_out_count           11      6    total:  17
  dma_error_count             0      0    total:  0
  resource_unavail_count      0      0    total:  0
  data_reassembly_count       0      0    total:  0
  device_quiesce_count      216    218    total:  434

  • porterrshow 显示大量 crc errdisc c3

hshshshshshs      frames      enc    crc    crc    too    too    bad    enc   disc   link   loss   loss   frjt   fbsy  c3timeout    pcs    uncor
       tx     rx      in    err    g_eof  shrt   long   eof     out   c3    fail    sync   sig                  tx    rx     err    err
  0:    2.0g   3.1g   0      0      0      0      0      0      0     26      0      0      0      0      0      0      0      0      0
  1:    0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0
  2:    4.2g   4.0g   0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0
 3:    1.8g   3.7g   0     43.0k  42.3k   0     28    738      0     43.8k  34      0      1.3k   0      0     43.7k   1     46.9m   0
  4:    4.1g   1.7g   0      0      0      0      0      0      0      2      0      0      0      0      0      0      0      0      0
  5:    0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0
  6:    2.6g   4.0g   0      0      0      0      0      0      0      4      0      0      0      0      0      0      0      0      0
  7:    4.2g   4.1g   0      0      0      0      0      0      0     28      0      0      0      0      0      0      0      0      0
  8:    1.2g   2.2g   0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0      0
  9:    1.8g   2.5g   0      0      0      0      0      0      0     24.5k   2      0      2      0      0      0     24.5k   0      0

  • 从连接到受影响适配器端口的交换机收集SFP统计信息、并验证Tx/Rx电源是否正常。

> sfpshow 6
Current:    0.000   mAmps
Voltage:    3374.8  mVolts
RX Power:   -2.3   dBm (591.5uW)
TX Power:   -inf   dBm (0.0   uW)

 

Sign in to view the entire content of this KB article.

New to NetApp?

Learn more about our award-winning Support

NetApp provides no representations or warranties regarding the accuracy or reliability or serviceability of any information or recommendations provided in this publication or with respect to any results that may be obtained by the use of the information or observance of any recommendations provided herein. The information in this document is distributed AS IS and the use of this information or the implementation of any recommendations or techniques herein is a customer's responsibility and depends on the customer's ability to evaluate and integrate them into the customer's operational environment. This document and the information contained herein may be used solely in connection with the NetApp products discussed in this document.