Apache Gora
  1. Apache Gora
  2. GORA-117

gora hbase does not have a mechanism to set the caching on a scanner, which makes for poor performance on map/reduce jobs

    Details

    • Type: Bug Bug
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.2
    • Fix Version/s: 0.4
    • Component/s: gora-hbase
    • Labels:
      None

      Description

      goraci runs a map/reduce job over all the data that it generates. The hbase storage uses a scanner that doesn't cache rows, which means every fetch requires an RPC call. I experimented with

      scan.setCaching(1000);

      and goraci Verify ran about 30x faster.

      1. GORA-117.patch
        4 kB
        Alfonso Nishikawa

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            stack
            Reporter:
            Eric Newton
          • Votes:
            1 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development