Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-865

harchive: Reduce the number of open calls to _index and _masterindex

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Duplicate
    • None
    • None
    • harchive
    • None

    Description

      When I have har file with 1000 files in it,
      % hadoop dfs -lsr har:///user/knoguchi/myhar.har/
      would open/read/close the _index/_masterindex files 1000 times.

      This makes the client slow and add some load to the namenode as well.
      Any ways to reduce this number?

      Attachments

        1. mapreduce-865-0.patch
          8 kB
          Koji Noguchi

        Issue Links

          Activity

            People

              knoguchi Koji Noguchi
              knoguchi Koji Noguchi
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: