由于磁盘接收到有毒事务层数据包(PTLP)而导致接管
适用场景
- ONTAP 9
- 采用内部存储的AF/FAS系统
问题描述
- 由于磁盘接收到有毒事务层数据包(PTLP)而导致的接管:
PANIC: Uncorrectable Machine Check Error at CPU1. SKL_IIO Error: STATUS<0xf780000000010405>(VALID,OVERFLOW,UC,EN,ADDRV,PCC,S,AR,CORR_ERR_STATUS(0),CORR_ERR_CNT(0),MSCOD(0x1),MCACOD(0x405))IIO Machine Check from device(s):RPT(22,0,0):ErrSrcID(CorrSrc(0x1898),UCorrSrc(0)), PLX PCIE 9797 switch on Controller, PLX PCIE 9797 switch on Controller. , ADDR(0). in process idle: cpu1 on release 9.12.1P5 (C) on Tue Feb 13 10:00:03 PST 2024
- 在分析系统日志时、可以找到以下内容。这指向特定驱动器(根据以下输出14个):
"Poisoned Tranasaction Layer Packet (PTLP)"
This indicates an error on the link between these devices:
Dv[a824](27,0,0) in slot 14: PCI Device 144d:a824 in slot 14 on Controller
Br[9797](24,2,0): PLX PCIE 9797 switch on Controller
- 在EMS中、此驱动器报告了错误:
Tue Feb 13 10:00:07 -0800 [cluster: scsi_cmdblk_strthr_admin: disk.timeout.flush.start:debug]: Aggressive timeout flush started on disk 0n.14.