Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-575

DFSClient read performance can be improved by stagerring connection setup to datanode(s)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • hdfs-client
    • None

    Description

      The DFS client opens a socket connection to a DN for the n-th block, fetches n-th block from that datanode and then opens socket connections to the datanode that contains the n+1th block. Sequential-reads might show performance improvements if the setting up of socket connections to the datanode containing the n+1th block can happen in parallel while the data for the nth block is being fetched. The amount of improvement, if any, has to be measured.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              dhruba Dhruba Borthakur
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated: