CONTAP-348855:特定的 NetApp 平台可能会遇到间歇性读取校验和错误或控制器中断。
问题描述
在极少数情况下,在从平台 A1K、A70 和 A90 的 NVMe 命名空间或 NetApp FCP 或 iSCSI LUN 上托管的数据进行特定块的第一次读取操作期间,ONTAP 可能会提供错误的用户数据。
如果使用基于 ONTAP 的迁移从打开了传统压缩的平台将数据迁移到上述平台,则相同的缺陷可能会导致控制器中断。
签名示例:
- Windows 崩溃,BSOD 引用页入 CRC 违规:
Bug Check: 0x0000001a Parameter1: 0x3f
- Oracle alertlog 报告的序列类似于:
Corrupt block relative dba: 0x0000c4cd (file 42, block 50381) Reread (file 42, block 50381) found valid data Repaired corruption at (file 42, block 50381)
- mongo DB 报告以下序列:
{"t":\{"$date":"2025-07-03T13:32:31.752+07:00"},"s":"E", "c":"WT", "id":22435, "ctx":"ReplWriterWorker-54954","msg":"WiredTiger error message","attr":\{"error":0,"message":{"ts_sec":1751524351,"ts_usec":751549,"thread":"844:0x7f0038248700","session_dhandle_name":"file:index-288-4360026624752651856.wt","session_name":"WT_CURSOR.remove","category":"WT_VERB_DEFAULT","category_id":9,"verbose_level":"ERROR","verbose_level_id":-3,"msg":"__wt_block_read_off:220:index-288-4360026624752651856.wt: potential hardware corruption, read checksum error for 8192B block at offset 163471360: calculated block checksum doesn't match expected checksum"}}}
MSSQL 日志报告类似错误:
A read of the file 'D:\path\file.mdf' at offset 0x0000006d70a000 succeeded after failing 1 time(s) with error: torn page (expected signature: 0x00000000; actual signature: 0x00c03444). Additional messages in the SQL Server error log and operating system error log may provide more detail. This error condition threatens database integrity and must be corrected. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online.
或A read of the file 'C:\path_to\data.mdf' at offset 0x00000001a22000 succeeded after failing 1 time(s) with error: incorrect checksum (expected: 0x02c078e3; actual: 0x4dab258e). Additional messages in the SQL Server error log and operating system error log may provide more detail. This error condition threatens database integrity and must be corrected. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online.
或A read of the file 'D:\path_toe\system.mdf' at offset 0x0000006d70e000 succeeded after failing 1 time(s) with error: incorrect pageid (expected 1:224135; actual 27267:-1332691384). Additional messages in the SQL Server error log and operating system error log may provide more detail. This error condition threatens database integrity and must be corrected. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online.
对同一逻辑块地址的后续读取应成功,且未报告错误。
此问题适用于运行 9.15.1 及更高版本的上述平台的 AFF 和 ASA 变体。
运行 9.16.1RC1 及更高版本的系统未因此问题而受到影响。