Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-922

Optimize small reads and seeks

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.10.1
    • 0.11.0
    • None
    • None

    Description

      A seek on a DFSInputStream causes causes the next read to re-open the socket connection to the datanode and fetch the remainder of the block all over again. This is not optimal.

      A small read followed by a small positive seek could re-utilize the data already fetched from the datanode as part of the previous read.

      Attachments

        1. smallreadseek4.patch
          3 kB
          Dhruba Borthakur

        Issue Links

          Activity

            People

              dhruba Dhruba Borthakur
              dhruba Dhruba Borthakur
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: