Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-5026

For shortening the time of TaskTracker heartbeat, decouple the statics collection operations

    Details

    • Type: Improvement Improvement
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 1.1.1
    • Fix Version/s: 1.1.1
    • Component/s: performance, tasktracker
    • Labels:

      Description

      In each heartbeat of TaskTracker, it will calculate some system statics, like the free disk space, available virtual/physical memory, cpu usage, etc. However, it's not necessary to calculate all the statics in every heartbeat, and this will consume many system resource and impace the performance of TaskTracker heartbeat. Furthermore, the characteristics of system properties(disk, memory, cpu) are different and it's better to collect their statics in different intervals.

      To reduce the latency of TaskTracker heartbeat, one solution is to decouple all the system statics collection operations from it, and issue separate threads to do the statics collection works when the TaskTracker starts. The threads could be three: the first one is to collect cpu related statics in a short interval; the second one is to collect memory related statics in a normal interval; the third one is to collect disk related statics in a long interval. And all the interval could be customized by the parameter "mapred.stats.collection.interval" in the mapred-site.xml. At last, the heartbeat could get values of system statics from the memory directly.

      1. HDFS-4527.patch
        6 kB
        sam liu
      2. HDFS-4527.patch
        6 kB
        sam liu

        Activity

        sam liu created issue -
        sam liu made changes -
        Field Original Value New Value
        Status Open [ 1 ] Patch Available [ 10002 ]
        Fix Version/s 1.1.1 [ 12321656 ]
        sam liu made changes -
        Attachment HDFS-4527.patch [ 12570641 ]
        sam liu made changes -
        Status Patch Available [ 10002 ] Open [ 1 ]
        Andrew Wang made changes -
        Project Hadoop HDFS [ 12310942 ] Hadoop Map/Reduce [ 12310941 ]
        Key HDFS-4527 MAPREDUCE-5026
        Affects Version/s 1.1.1 [ 12321660 ]
        Affects Version/s 1.1.1 [ 12321656 ]
        Target Version/s 1.1.1 [ 12321656 ]
        Fix Version/s 1.1.1 [ 12321660 ]
        Fix Version/s 1.1.1 [ 12321656 ]
        Component/s performance [ 12316500 ]
        Component/s tasktracker [ 12312906 ]
        Component/s performance [ 12316501 ]
        sam liu made changes -
        Attachment HDFS-4527.patch [ 12574043 ]

          People

          • Assignee:
            Unassigned
            Reporter:
            sam liu
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:

              Time Tracking

              Estimated:
              Original Estimate - 24h
              24h
              Remaining:
              Remaining Estimate - 24h
              24h
              Logged:
              Time Spent - Not Specified
              Not Specified

                Development