Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-3201

Even though jobs are getting failed on particular NM, it is not getting blacklisted

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 0.23.0
    • Fix Version/s: None
    • Component/s: mrv2
    • Labels:
      None

      Description

      The yarnchild process on a particular NM are getting killed continuosly. 
      Still the NM is not getting blacklisted
      

        Activity

        Hide
        Vinod Kumar Vavilapalli added a comment -

        Ramgopal, we do have job-level blacklist today. Can you look at your AM logs and grep for the following log messsages? Thanks.

        nodeBlacklistingEnabled:
        maxTaskFailuresPerNode is
        failures on node
        Blacklisted host

        There is a known bug related to NM blacklisting by MR jobs - MAPREDUCE-2693, but doesn't look like you are running into that.

        Show
        Vinod Kumar Vavilapalli added a comment - Ramgopal, we do have job-level blacklist today. Can you look at your AM logs and grep for the following log messsages? Thanks. nodeBlacklistingEnabled: maxTaskFailuresPerNode is failures on node Blacklisted host There is a known bug related to NM blacklisting by MR jobs - MAPREDUCE-2693 , but doesn't look like you are running into that.
        Hide
        Ravi Prakash added a comment -

        Please reopen if this is still an issue

        Show
        Ravi Prakash added a comment - Please reopen if this is still an issue

          People

          • Assignee:
            Unassigned
            Reporter:
            Ramgopal N
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development