Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
Calling getBlockLocations on files that have a PROVIDED replica, results in the datanode locations being selected at random. Currently, this randomization uses the datanode uuids to pick a node at random (ProvidedDescriptor#choose, ProvidedDescriptor#chooseRandom). Depending on the distribution of the datanode UUIDs, this can lead to large number of iterations (which may not terminate) before a location is chosen. This JIRA aims to replace this with a more efficient randomization strategy.
Attachments
Attachments
Issue Links
- is related to
-
HDFS-12778 [READ] Report multiple locations for PROVIDED blocks
- Resolved