Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-8016

HBase as an embeddable library, but still using HDFS

    XMLWordPrintableJSON

Details

    • Wish
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • None
    • None
    • None

    Description

      This goes in the "strange idea" bucket...

      I'm looking for a tool to allow folks to store key-value data into HDFS so that hadoop companion layers & apps don't need to rely either on external database or a NoSQL store. HBase itself is often not running on such clusters and we can not add it as a requirement for many of the use cases I'm considering.

      But... what if we produced a library that provided the basic HBase API (creating tables & putting / getting values...) and this library was pointed at HDFS for durability. This library would effectively embed a region server and the the master in a node and provide only API level access within that JVM. We would skip marshaling & networking, gaining a fair amount of efficiency. An application using this library would gain all of the advantages of HBase without adding any additional administrative complexity of managing HBase as a distributed service.

      Thoughts?

      Example use cases... Right now a typical hadoop install runs serval services that use databases (Oozie, HCat, Hive ...). What if some of these could be ported to use HDFS itself as their store with the HBase API provided to manage their data.

      Attachments

        Activity

          People

            Unassigned Unassigned
            eric14 Eric Baldeschwieler
            Votes:
            0 Vote for this issue
            Watchers:
            16 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: