Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-3121

DFIP aka 'NodeManager should handle Disk-Failures In Place'

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: 0.23.0
    • Fix Version/s: 0.23.1
    • Component/s: mrv2, nodemanager
    • Labels:
      None

      Description

      This is akin to MAPREDUCE-2413 but for YARN's NodeManager. We want to minimize the impact of transient/permanent disk failures on containers. With larger number of disks per node, the ability to continue to run containers on other disks is crucial.

        Attachments

        1. 3121.v3.patch
          222 kB
          Ravi Gummadi
        2. 3121.v2.patch
          195 kB
          Ravi Gummadi
        3. 3121.v1.patch
          140 kB
          Ravi Gummadi
        4. 3121.v1.1.patch
          140 kB
          Ravi Gummadi
        5. 3121.patch
          79 kB
          Ravi Gummadi

          Issue Links

            Activity

              People

              • Assignee:
                ravidotg Ravi Gummadi
                Reporter:
                vinodkv Vinod Kumar Vavilapalli
              • Votes:
                0 Vote for this issue
                Watchers:
                12 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: