Hive
  1. Hive
  2. HIVE-3603

Enable client-side caching for scans on HBase

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.12.0
    • Component/s: HBase Handler
    • Labels:
      None

      Description

      HBaseHandler sets up a TableInputFormat MR job against HBase to read data in. The underlying implementation (in HBaseHandler.java) makes an RPC call per row-key, which makes it very inefficient. Need to specify a client side cache size on the scan.

      Note that HBase currently only supports num-rows based caching (no way to specify a memory limit). Created HBASE-6770 to address this.

        Issue Links

          Activity

            People

            • Assignee:
              Navis
              Reporter:
              Karthik Ranganathan
            • Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development