HBase
  1. HBase
  2. HBASE-6800

Build a Document Store on HBase for Better Query Processing

    Details

    • Type: New Feature New Feature
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 0.95.2
    • Fix Version/s: None
    • Component/s: Coprocessors, Performance
    • Labels:
      None

      Description

      In the last couple of years, increasingly more people begin to stream data into HBase in near time, and
      use high level queries (e.g., Hive) to analyze the data in HBase directly. While HBase already has very effective MapReduce integration with its good scanning performance, query processing using MapReduce on HBase still has significant gaps compared to HDFS: ~3x space overheads and 3~5x performance overheads according to our measurement.

      We propose to implement a document store on HBase, which can greatly improve query processing on HBase (by leveraging the relational model and read-mostly access patterns). According to our prototype, it can reduce space usage by up-to ~3x and speedup query processing by up-to ~1.8x.

      1. dot-deisgn.pdf
        353 kB
        Jason Dai

        Issue Links

          Activity

          No work has yet been logged on this issue.

            People

            • Assignee:
              Unassigned
              Reporter:
              Jason Dai
            • Votes:
              1 Vote for this issue
              Watchers:
              30 Start watching this issue

              Dates

              • Created:
                Updated:

                Development