Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-16517

In 2.10 the distance metric is wrong for non-DN machines

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.10.1
    • 2.10.2
    • None

    Description

      In 2.10, the metric for distance between the client and the data node is wrong for machines that aren't running data nodes (ie. getWeightUsingNetworkLocation). The code works correctly in 3.3+.

      Currently

       

      Client DataNode getWeight getWeightUsingNetworkLocation
      /rack1/node1 /rack1/node1 0 0
      /rack1/node1 /rack1/node2 2 2
      /rack1/node1 /rack2/node2 4 2
      /pod1/rack1/node1 /pod1/rack1/node2 2 2
      /pod1/rack1/node1 /pod1/rack2/node2 4 2
      /pod1/rack1/node1 /pod2/rack2/node2 6 4

       

      This bug will destroy data locality on clusters where the clients share racks with DataNodes, but are running on machines that aren't running DataNodes, such as striping federated HDFS clusters across racks.

      Attachments

        Issue Links

          Activity

            People

              omalley Owen O'Malley
              omalley Owen O'Malley
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1.5h
                  1.5h