Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-3289

Make use of fadvise in the NM's shuffle handler

VotersStop watchingWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      Using the new NativeIO fadvise functions, we can make the NodeManager prefetch map output before it's send over the socket, and drop it out of the fs cache once it's been sent (since it's very rare for an output to have to be re-sent). This improves IO efficiency and reduces cache pollution.

      Attachments

        1. mr-3289.txt
          5 kB
          Todd Lipcon
        2. MR3289_trunk.txt
          9 kB
          Siddharth Seth
        3. MR3289_trunk_3.txt
          13 kB
          Siddharth Seth
        4. MR3289_trunk_2.txt
          13 kB
          Siddharth Seth
        5. MAPREDUCE-3289.branch-1.patch
          4 kB
          Brandon Li
        6. MAPREDUCE-3289.branch-1.patch
          4 kB
          Brandon Li
        7. 3289-2.txt
          4 kB
          Todd Lipcon
        8. 3289-1.txt
          3 kB
          Todd Lipcon

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            tlipcon Todd Lipcon
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            20 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment