HBase
  1. HBase
  2. HBASE-745

scaling of one regionserver, improving memory and cpu usage

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 0.1.3, 0.2.0
    • Fix Version/s: 0.2.0
    • Component/s: regionserver
    • Labels:
      None
    • Environment:

      hadoop 0.17.1

      Description

      after weeks testing hbase 0.1.3 and hadoop(0.16.4, 0.17.1), i found there are many works to do, before a particular regionserver can handle data about 100G, or even more. i'd share my opions here with stack, and other developers.

      first, the easiest way improving scalability of regionserver is upgrading hardware, use 64bit os and 8G memory for the regionserver process, and speed up disk io.

      besides hardware, following are software bottlenecks i found in regionserver:
      1. as data increasing, compaction was eating cpu(with io) times, the total compaction time is basicly linear relative to whole data size, even worse, sometimes square relavtive to that size.
      2. memory usage are depends on opened mapfiles
      3. network connection are depends on opened mapfiles, see HADOOP-2341 and HBASE-24.

      1. HBASE-745.compact.patch
        3 kB
        Luo Ning
      2. hbase-745-for-0.2.patch
        4 kB
        Izaak Rubin

        Issue Links

          Activity

          No work has yet been logged on this issue.

            People

            • Assignee:
              Unassigned
              Reporter:
              Luo Ning
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development