Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2025

Improve job counter wire protocol with incremental schema and delta values

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.20.2
    • None
    • client, jobtracker, task
    • None
    • counter

    Description

      The current job counter protocol is quite verbose (string key and vlong value pairs) especially when keys are long. By using an incremental schema (key to key id mapping) and data (vint key id to vlong value delta) protocol, we can significantly lower the counter overhead and allow us to up (from the current 1 update/minute) the counter update frequency. This jira tracks the user counter (for apps or tools (pig etc.) built on top of mr framework) improvement. Framework (mr/hdfs etc) counter improvement is tracked by MAPREDUCE-901 (a more common case with more optimization opportunities, as the schema is fixed.)

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            vicaya Luke Lu
            vicaya Luke Lu

            Dates

              Created:
              Updated:

              Slack

                Issue deployment