Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2846

a small % of all tasks fail with DefaultTaskController

    XMLWordPrintableJSON

Details

    • Reviewed
    • Fixed a race condition in writing the log index file that caused tasks to 'fail'.

    Description

      After upgrading our test 0.20.203 grid to 0.20.204-rc2, we ran terasort to verify operation. While the job completed successfully, approx 10% of the tasks failed with task runner execution errors and the inability to create symlinks for attempt logs.

      Attachments

        1. sync.patch
          0.9 kB
          Owen O'Malley
        2. sync-trunk.patch
          1 kB
          Owen O'Malley

        Issue Links

          Activity

            People

              omalley Owen O'Malley
              aw Allen Wittenauer
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: