Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-3121

DFIP aka 'NodeManager should handle Disk-Failures In Place'

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 0.23.0
    • 0.23.1
    • mrv2, nodemanager
    • None

    Description

      This is akin to MAPREDUCE-2413 but for YARN's NodeManager. We want to minimize the impact of transient/permanent disk failures on containers. With larger number of disks per node, the ability to continue to run containers on other disks is crucial.

      Attachments

        1. 3121.patch
          79 kB
          Ravi Gummadi
        2. 3121.v1.1.patch
          140 kB
          Ravi Gummadi
        3. 3121.v1.patch
          140 kB
          Ravi Gummadi
        4. 3121.v2.patch
          195 kB
          Ravi Gummadi
        5. 3121.v3.patch
          222 kB
          Ravi Gummadi

        Issue Links

          There are no Sub-Tasks for this issue.

          Activity

            People

              ravidotg Ravi Gummadi
              vinodkv Vinod Kumar Vavilapalli
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: