在单个 NIC 端口上收到 CRC 错误
状态信息
适用场景
- ONTAP 9
- FAS/AFF 系统
- 在单个端口上报告 CRC 错误
问题描述
- 事件日志报告物理和/或逻辑端口上的硬件错误。
[node-01: vifmgr: vifmgr.cluscheck.crcerrors]: Port a0b on node node-01 is reporting a high number of observed hardware errors, possibly CRC errors
[node-02: vifmgr: vifmgr.cluscheck.crcerrors]: Port e0d on node node-02 is reporting a high number of observed hardware errors, possibly CRC errors
[node-02: vifmgr: vifmgr.cluscheck.hwerrors:alert]: Port e0d on node node-02 is reporting a high number (at least 1 per 1000 packets) of observed hardware errors (CRC, length, alignment, dropped)
[node-02: vifmgr: callhome.clus.net.degraded:alert]: Call home for CLUSTER NETWORK DEGRADED: CRC Errors Detected - High CRC errors detected on port e0d node node-02
ifstat
如果ONTAP 收到错误、则输出将显示CRC错误。- 重新 安装缆线/SFP后、受影响
ifstat -z
节点上的问题描述仍然存在。
RECEIVE
Total frames: 36418m | Frames/second: 23646 | Total bytes: 179t
Bytes/second: 116m | Total errors: 170k | Errors/minute: 7
Total discards: 0 | Discards/minute: 0 | Multi/broadcast: 1686k
Non-primary u/c: 0 | CRC errors: 159k | Long frames: 0
- 在交换机端口或客户端上可能会出现 CRC 错误,并且可能会因数据包丢失而出现延迟
2022-03-20T17:39:36.443Z cpu36:2098075)WARNING: ScsiDeviceIO: 1498: Device naa.600a09803830574c4d5d53ddf26c4543 performance has deteriorated. I/O latency increased from average value of 18171 microseconds to 1816780 microseconds.
要记住的要点:
许多交换环境都使用直通交换、而不是存储和转发交换、因为其速度较快
- 这意味着故障硬件可能不在直接连接的链路上
- CRC可能发生在上游
- 对于
ifstat
CRC错误、此值在中显示为非零值 - 如果虚电路为零但交换机具有虚电路、则可能会传输问题、但ONTAP未发现错误