Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2952

Application failure diagnostics are not consumed in a couple of cases

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Blocker Blocker
    • Resolution: Fixed
    • Affects Version/s: 0.23.0
    • Fix Version/s: 0.23.0
    • Component/s: mrv2, resourcemanager
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      When Container crashes, the reason for failures isn't propagated because of a bug in RMAppAttemptImpl.AMContainerCrashedTransition which simply discards the diagnostics of the container. Also RMAppAttemptImpl.diagnostics is never consumed.

      1. MAPREDUCE-2952.patch
        8 kB
        Arun C Murthy
      2. MAPREDUCE-2952.patch
        31 kB
        Arun C Murthy
      3. MAPREDUCE-2952.patch
        51 kB
        Arun C Murthy
      4. MAPREDUCE-2952.patch
        75 kB
        Arun C Murthy
      5. MAPREDUCE-2952.patch
        79 kB
        Arun C Murthy

        Issue Links

          Activity

          Vinod Kumar Vavilapalli created issue -
          Vinod Kumar Vavilapalli made changes -
          Field Original Value New Value
          Fix Version/s 0.23.0 [ 12315570 ]
          Component/s client [ 12312982 ]
          Component/s mrv2 [ 12314301 ]
          Component/s resourcemanager [ 12315340 ]
          Vinod Kumar Vavilapalli made changes -
          Link This issue is blocked by MAPREDUCE-2937 [ MAPREDUCE-2937 ]
          Vinod Kumar Vavilapalli made changes -
          Summary JobClient doesn't show the reason why AM crashed Application failure diagnostics are not consumed in a couple of cases
          Description When MR ApplicationMaster crashes, say because of a localization error, NM promptly reports the exception trace to the RM via diagnostics. Client eventually figures out that the application has failed, but it never prints the reason.

          The reason for failure is already propagated to the client in most cases via _ApplicationReport_ but it isn't printed/logged.
          When Container crashes, the reason for failures isn't propagated because of a bug in _RMAppAttemptImpl.AMContainerCrashedTransition_ which simply discards the diagnostics of the container. Also RMAppAttemptImpl.diagnostics is never consumed.
          Component/s client [ 12312982 ]
          Arun C Murthy made changes -
          Priority Major [ 3 ] Blocker [ 1 ]
          Arun C Murthy made changes -
          Assignee Arun C Murthy [ acmurthy ]
          Arun C Murthy made changes -
          Attachment MAPREDUCE-2952.patch [ 12496052 ]
          Arun C Murthy made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Arun C Murthy made changes -
          Status Patch Available [ 10002 ] Open [ 1 ]
          Arun C Murthy made changes -
          Attachment MAPREDUCE-2952.patch [ 12496248 ]
          Arun C Murthy made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Vinod Kumar Vavilapalli made changes -
          Status Patch Available [ 10002 ] Open [ 1 ]
          Arun C Murthy made changes -
          Attachment MAPREDUCE-2952.patch [ 12496298 ]
          Arun C Murthy made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Arun C Murthy made changes -
          Status Patch Available [ 10002 ] Open [ 1 ]
          Arun C Murthy made changes -
          Attachment MAPREDUCE-2952.patch [ 12496353 ]
          Arun C Murthy made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Arun C Murthy made changes -
          Status Patch Available [ 10002 ] Open [ 1 ]
          Arun C Murthy made changes -
          Attachment MAPREDUCE-2952.patch [ 12496357 ]
          Arun C Murthy made changes -
          Status Open [ 1 ] Patch Available [ 10002 ]
          Vinod Kumar Vavilapalli made changes -
          Status Patch Available [ 10002 ] Resolved [ 5 ]
          Hadoop Flags [Reviewed]
          Resolution Fixed [ 1 ]
          Arun C Murthy made changes -
          Status Resolved [ 5 ] Closed [ 6 ]

            People

            • Assignee:
              Arun C Murthy
              Reporter:
              Vinod Kumar Vavilapalli
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development