Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-3603

Enable client-side caching for scans on HBase

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.12.0
    • Component/s: HBase Handler
    • Labels:
      None

      Description

      HBaseHandler sets up a TableInputFormat MR job against HBase to read data in. The underlying implementation (in HBaseHandler.java) makes an RPC call per row-key, which makes it very inefficient. Need to specify a client side cache size on the scan.

      Note that HBase currently only supports num-rows based caching (no way to specify a memory limit). Created HBASE-6770 to address this.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                navis Navis
                Reporter:
                karthik.ranga Karthik Ranganathan
              • Votes:
                0 Vote for this issue
                Watchers:
                14 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: