NVMe 驱动器覆盖 bdata
适用场景
- ONTAP 9
- X4013 驱动器
问题描述
- 反复出现崩溃
PANIC: page fault (supervisor read data, page not present) on VA 0xd8 cs:rip 0x20:0xffffffff8c3b87ab rflags 0x10286 in SK process wafl_exempt2
PANIC: A potential in-memory corruption of file system metadata (public inode 96, level 1, fbn 1704801990, incremental checksum mismatch) has been detected. Rebooting the node to keep the file system on disk unaffected. Contact NetApp technical support for help. in SK process wafl_exempt17
PANIC: wafl_load_buf: wip = 0x0xfffff840be4c8480, fbn = 2251731920, level = 1, last logical block = 84645, last physical block = 84645, flags = 0, in SK process wafl_exempt08
- 导致崩溃的磁盘错误
Sat Apr 10 11:30:27 +0000 [node-01: scsi_cmdblk_strthr_admin: scsi.cmd.checkCondition:error]: Disk device 0n.20L0: Check Condition: CDB 0x28:45d52998:0001: Sense Data SCSI:aborted command - (0xb - 0x90 0x6 0xfa)(11099).
Sat Apr 10 11:30:27 +0000 [node-01: scsi_cmdblk_strthr_admin: scsi.cmd.aborted:error]: Disk device 0n.20L0: Command aborted: cdb 0x28:45d52998:0001 (11099).
Sat Apr 10 11:30:27 +0000 [node-01: disk_server_0: disk.IO.status:debug]: params: {'deviceName': '0n.20', 'returnCode': '9', 'pathRetryCount': '0', 'adapterStatus': '0x0', 'cdb': '0x28:45d52998:0001', 'basicTimeout': '5', 'iASCQ': '0x6', 'iSenseKey': '0xb', 'sSenseCode': '', 'ETime': '11101', 'iASC': '0x90', 'victimRetryCount': '0', 'sSenseKey': 'SCSI:aborted command', 'targetStatus': '0x2', 'disk_information': 'Disk 0n.20 Shelf 0 Bay 20 [NETAPP X4013S172B7T6NTE NA56] S/N [xxxxxxxxxxx] UID [34393030:4D601317:00253859:00000004:00000000:00000000:00000000:00000000:00000000:00000000]', 'retryCount': '0', 'pathsTried': '0', 'timeoutRetryCount': '0'}