Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-6111

CLONE - Map tasks not local to RS

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • 0.20.2, 0.90.4
    • None
    • master, regionserver
    • None
    • DN, TT and RS running on the same nodes, all using CDH3. Ganglia monitoring everything.

    Description

      I have started seeing this issue in our environment. HBASE-1672 was closed as non reproducible, so I cloned it here.

      I have a 367M record table, compressed with snappy, and running a vanilla MR SCAN with no filters spawns 441 Mappers. The cluster currently has 216 slots for mappers, and the first wave all report 100% data-local mappers. As the second wave of mappers come up they don't get run locally to the RS and data locality drops.

      This kills our environment, as it saturates the network at 120M which is very clear on ganglia.

      I am really happy to help diagnose this, but need some guidance on what to do. I don't know enough yet about how task assignment works in MR to determine why the machines are picking up random tasks for their second effort and not one for the local RS.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            timrobertson100 Tim Robertson
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment