Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.22.0
    • Fix Version/s: 0.23.3, 2.0.2-alpha, 1.1.2
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      Some very small percentage of tasks fail with a "Text file busy" error.

      The following was the original diagnosis:

      Our use of PrintWriter in TaskController.writeCommand is unsafe, since that class swallows all IO exceptions. We're not currently checking for errors, which I'm seeing result in occasional task failures with the message "Text file busy" - assumedly because the close() call is failing silently for some reason.

      .. but turned out to be another issue as well (see below)

      1. successfull_taskjvmsh.strace
        33 kB
        Shrinivas Joshi
      2. mapreduce-2374-on-20sec.txt
        2 kB
        Todd Lipcon
      3. mapreduce-2374-branch-1.patch
        3 kB
        Shrinivas Joshi
      4. mapreduce-2374-2.txt
        1 kB
        Andy Isaacson
      5. mapreduce-2374.txt
        0.9 kB
        Andy Isaacson
      6. mapreduce-2374.txt
        0.9 kB
        Andy Isaacson
      7. mapreduce-2374.txt
        1 kB
        Andy Isaacson
      8. failed_taskjvmsh.strace
        10 kB
        Shrinivas Joshi

        Issue Links

          Activity

            People

            • Assignee:
              Andy Isaacson
              Reporter:
              Todd Lipcon
            • Votes:
              0 Vote for this issue
              Watchers:
              24 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development