Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-11985

Document sizing rules of thumb

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.0
    • 2.0.0
    • documentation
    • None
    • Reviewed

    Description

      I'm looking for tuning/sizing rules of thumb to put in the Ref Guide.

      Info I have gleaned so far:

      A reasonable region size is between 10 GB and 50 GB.

      A reasonable maximum cell size is 1 MB to 10 MB. If your cells are larger than 10 MB, consider storing the cell contents in HDFS and storing a reference to the location in HBase. Pending MOB work for 10 MB - 64 MB window.

      When you size your regions and cells, keep in mind that a region cannot split across a row. If your row size is too large, or your region size is too small, you can end up with a single row per region, which is not a good pattern. It is also possible that one big column causes splits while other columns are tiny, and this may not be great.

      A large # of columns probably means you are doing it wrong.

      Column names need to be short because they get stored for every value (barring encoding). Don't need to be self-documenting like in RDBMS.

      Attachments

        1. HBASE-11985.patch
          3 kB
          M Linville

        Activity

          People

            misty M Linville
            misty M Linville
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: