Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-5044

Have AM trigger jstack on task attempts that timeout before killing them

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.1.0-beta
    • 2.8.0, 3.0.0-alpha1
    • mr-am
    • None
    • Reviewed

    Description

      When an AM expires a task attempt it would be nice if it triggered a jstack output via SIGQUIT before killing the task attempt. This would be invaluable for helping users debug their hung tasks, especially if they do not have shell access to the nodes.

      Attachments

        1. MAPREDUCE-5044.v01.patch
          67 kB
          Gera Shegalov
        2. Screen Shot 2013-11-12 at 1.05.32 PM.png
          40 kB
          Gera Shegalov
        3. Screen Shot 2013-11-12 at 1.06.04 PM.png
          191 kB
          Gera Shegalov
        4. MAPREDUCE-5044.v02.patch
          11 kB
          Gera Shegalov
        5. MAPREDUCE-5044.v03.patch
          6 kB
          Gera Shegalov
        6. MAPREDUCE-5044.v04.patch
          12 kB
          Gera Shegalov
        7. MAPREDUCE-5044.v05.patch
          15 kB
          Gera Shegalov
        8. MAPREDUCE-5044.v06.patch
          16 kB
          Gera Shegalov
        9. MAPREDUCE-5044.v07.local.patch
          35 kB
          Eric Payne
        10. MAPREDUCE-5044.008.patch
          35 kB
          Eric Payne
        11. MAPREDUCE-5044.009.patch
          35 kB
          Eric Payne
        12. MAPREDUCE-5044.010.patch
          42 kB
          Eric Payne
        13. MAPREDUCE-5044.011.patch
          58 kB
          Eric Payne
        14. MAPREDUCE-5044.012.patch
          60 kB
          Eric Payne
        15. MAPREDUCE-5044.013.patch
          60 kB
          Eric Payne

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            epayne Eric Payne
            jlowe Jason Darrell Lowe
            Votes:
            0 Vote for this issue
            Watchers:
            23 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment