[HDFS-15574] Remove unnecessary sort of block list in DirectoryScanner - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 3.4.0
Fix Version/s: 3.2.2, 3.3.1, 3.4.0, 3.2.3
Component/s: datanode
Labels:
None

Target Version/s:

3.4.0
Hadoop Flags:

Reviewed

Description

These lines of code in DirectoryScanner#scan(), obtain a snapshot of the finalized blocks from memory, and then sort them, under the DN lock. However the blocks are stored in a sorted structure (FoldedTreeSet) and hence the sort should be unnecessary.

  final List<ReplicaInfo> bl = dataset.getFinalizedBlocks(bpid);
  Collections.sort(bl); // Sort based on blockId

This Jira removes the sort, and renames the getFinalizedBlocks to getSortedFinalizedBlocks to make the intent of the method more clear.

Also added a test, just in case the underlying block structure is ever changed to something unsorted.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-15574.branch-3.3.002.patch
16/Sep/20 13:30
9 kB
Stephen O'Donnell
HDFS-15574.branch-3.3.001.patch
15/Sep/20 07:37
9 kB
Stephen O'Donnell
HDFS-15574.branch-3.2.002.patch
15/Sep/20 17:15
10 kB
Stephen O'Donnell
HDFS-15574.branch-3.2.001.patch
15/Sep/20 07:56
9 kB
Stephen O'Donnell
HDFS-15574.003.patch
13/Sep/20 07:27
9 kB
Stephen O'Donnell
HDFS-15574.002.patch
11/Sep/20 15:28
8 kB
Stephen O'Donnell
HDFS-15574.001.patch
11/Sep/20 10:13
8 kB
Stephen O'Donnell

Issue Links

relates to

HDFS-13671 Namenode deletes large dir slowly caused by FoldedTreeSet#removeAndGet

Resolved

Activity

People

Assignee:: Stephen O'Donnell

Reporter:: Stephen O'Donnell

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 11/Sep/20 09:46

Updated:: 27/Jan/24 03:09

Resolved:: 17/Sep/20 04:51