Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.7.1
-
Reviewed
Description
The new 2-level directory layout can make directory scans expensive in terms of disk seeks (see HDFS-8791) for details.
It would be good if the directoryScanner() had a configurable duty cycle that would reduce its impact on disk performance (much like the approach in HDFS-8617).
Without such a throttle, disks can go 100% busy for many minutes at a time (assuming the common case of all inodes in cache but no directory blocks cached, 64K seeks are required for full directory listing which translates to 655 seconds)
Attachments
Attachments
Issue Links
- breaks
-
HDFS-12319 DirectoryScanner will throw IllegalStateException when Multiple BP's are present
- Resolved
-
HDFS-9174 Fix findbugs warnings in FSOutputSummer.tracer and DirectoryScanner$ReportCompiler.currentThread
- Resolved
-
HDFS-11707 TestDirectoryScanner#testThrottling fails on OSX
- Resolved
- is duplicated by
-
HDFS-8989 Allow rate-limiting the DirectoryScanner via time-slicing
- Resolved
- is required by
-
HDFS-9176 TestDirectoryScanner#testThrottling often fails.
- Resolved
- relates to
-
HDFS-8791 block ID-based DN storage layout can be very slow for datanode on ext4
- Resolved