Details

    • Type: Improvement Improvement
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Not A Problem
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      I have an HBase table where items are keyed by a sequential row id. When doing bulk appends to this table, the last region of the table gets pretty large until enough compactions can be done to sort everything out.

      In this type of case, it'd be better if when the region serving the largest keys doesn't split at the midkey, but on the last key. One way to implement this would be by saying that when the top region reaches MAX_REGION_SIZE/2, create a new region, with the lower half getting all the data and the top half empty. For bulk sequential inserts, this should avoid the need for any compactions.

        Issue Links

          Activity

          Ben Maurer created issue -
          Evgeny Ryabitskiy made changes -
          Field Original Value New Value
          Link This issue is related to HBASE-1261 [ HBASE-1261 ]
          Evgeny Ryabitskiy made changes -
          Link This issue is related to HBASE-1261 [ HBASE-1261 ]
          Evgeny Ryabitskiy made changes -
          Link This issue relates to HBASE-1261 [ HBASE-1261 ]
          Hide
          stack added a comment -

          Nice idea. If a patch comes in before 0.20.0 is cut, will include. Meantime moving this out of 0.20.0.

          Show
          stack added a comment - Nice idea. If a patch comes in before 0.20.0 is cut, will include. Meantime moving this out of 0.20.0.
          stack made changes -
          Fix Version/s 0.20.0 [ 12313474 ]
          Hide
          Andrew Purtell added a comment -

          Superseded by a lot of work on pluggable policies.

          Show
          Andrew Purtell added a comment - Superseded by a lot of work on pluggable policies.
          Andrew Purtell made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Resolution Not a Problem [ 8 ]
          Transition Time In Source Status Execution Times Last Executer Last Execution Date
          Open Open Resolved Resolved
          1956d 20h 42m 1 Andrew Purtell 16/Jul/14 22:46

            People

            • Assignee:
              Unassigned
              Reporter:
              Ben Maurer
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development