Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-3327

For increment workloads, retain memstores in memory after flushing them

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Incomplete
    • None
    • None
    • regionserver
    • None

    Description

      This is an improvement based on our observation of what happens in an increment workload. The working set is typically small and is contained in the memstores.
      1. The reason the memstores get flushed is because the number of wal logs limit gets hit.
      2. This in turn triggers compactions, which evicts the block cache.
      3. Flushing of memstore and eviction of the block cache causes disk reads for increments coming in after this because the data is no longer in memory.

      We could solve this elegantly by retaining the memstores AFTER they are flushed into files. This would mean we can quickly populate the new memstore with the working set of data from memory itself without having to hit disk. We can throttle the number of such memstores we retain, or the memory allocated to it. In fact, allocating a percentage of the block cache to this would give us a huge boost.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              karthik.ranga Karthik Ranganathan
              Votes:
              2 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: