Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.22.0
    • Fix Version/s: 0.23.3, 2.0.2-alpha, 1.1.2
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      Some very small percentage of tasks fail with a "Text file busy" error.

      The following was the original diagnosis:

      Our use of PrintWriter in TaskController.writeCommand is unsafe, since that class swallows all IO exceptions. We're not currently checking for errors, which I'm seeing result in occasional task failures with the message "Text file busy" - assumedly because the close() call is failing silently for some reason.

      .. but turned out to be another issue as well (see below)

        Attachments

        1. failed_taskjvmsh.strace
          10 kB
          Shrinivas Joshi
        2. mapreduce-2374.txt
          1 kB
          Andy Isaacson
        3. mapreduce-2374.txt
          0.9 kB
          Andy Isaacson
        4. mapreduce-2374.txt
          0.9 kB
          Andy Isaacson
        5. mapreduce-2374-2.txt
          1 kB
          Andy Isaacson
        6. mapreduce-2374-branch-1.patch
          3 kB
          Shrinivas Joshi
        7. mapreduce-2374-on-20sec.txt
          2 kB
          Todd Lipcon
        8. successfull_taskjvmsh.strace
          33 kB
          Shrinivas Joshi

          Issue Links

            Activity

              People

              • Assignee:
                adi2 Andy Isaacson
                Reporter:
                tlipcon Todd Lipcon
              • Votes:
                0 Vote for this issue
                Watchers:
                23 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: