FlexCache卷已满、NFS客户端请求挂起
适用场景
- ONTAP 9.5及更高版本
- FlexCache
- NFS
问题描述
- FlexCache卷已满99%、并导致客户端请求挂起。
- NFS客户端上的"ls"和"cd"等操作已挂起。
- 初始卷未满、并在EMS日志中注意到以下错误:
Wed Mar 29 11:27:54 -0700 [nodeA: wafl_exempt10: wafl.vol.full:alert]: Insufficient space on volume vol__0001@vserver:a385f57a-afbd-11ed-91c0-00a098ba0334 to perform operation. 432KB was requested but only 384KB was available.Wed Mar 29 11:27:55 -0700 [nodeA: wafl_spcd_main: monitor.volume.full:debug]: Volume vol__0001@vserver:a385f57a-afbd-11ed-91c0-00a098ba0334 is full (using or reserving 99% of space and 7% of inodes).Wed Mar 29 11:27:56 -0700 [nodeA: FgGroupListTimer: fg.space.member.full:alert]: Constituent 1099 in FlexGroup vol (fg-uuid b5a85457-b48e-11ed-948e-00a098dec0b4) is out of space.Wed Mar 29 11:37:54 -0700 [nodeA: wafl_exempt13: wafl.vol.full:alert]: Insufficient space on volume vol__0001@vserver:a385f57a-afbd-11ed-91c0-00a098ba0334 to perform operation. 424KB was requested but only 380KB was available.- NFS操作失败:
Wed Mar 29 11:28:12 -0700 [nodeA: kernel: Nblade.dBladeNoResponse.NFS:error]: File operation timed out because there was no response from the data-serving node. Node UUID: 858edac4-7bd1-11ed-a6ec-00a098dec0b4, file operation protocol: NFS, client IP address: 10.1.2.3, RPC procedure: 17.- 在问题描述时间Sktrace显示如下:
2023-03-29T18:27:55Z 14646667110509780 [13:0] WAFLREMOTE_EXCEPTION: store cache 1089.4389 of origin 2156655294.1853 snapid 0: debt enospc (error 292)2023-03-29T18:27:55Z 14646667110610864 [13:0] WAFLREMOTE_EXCEPTION: store cache 1089.4389 of origin 2156655294.1850 snapid 0: debt enospc (error 292)- 理想情况下、FlexCache卷应运行擦除作业、并 在卷 已满90%时清除数据。但在这种情况下、卷已达到99%。