Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.7.1, 2.4.6
-
None
Description
We currently share the DirScanPool (threadpool for scanning for files to delete in the OldLogs and archive directories) between the LogCleaner and HFileCleaner. This means that if the archive directory is large/has lots of files/directories, the threads can get stuck scanning through the archive directory, starving the LogCleaner. This is especially apparent on S3 where list can be slower than on HDFS.
This JIRA creates separate DirScanPools for the LogCleaner and HFileCleaner