Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-753

Parallelize HBase scan

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Won't Fix
    • Affects Version/s: Impala 1.0
    • Fix Version/s: None
    • Component/s: Backend
    • Labels:
      None

      Description

      Impala chains all the regions on the same region sever and scan the data using one scan. The lack of parallel scan within the region server makes Impala slower than Hive when scanning HBase table.

      Impala can simply parallelize scanning regions within the same region server.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                alan@cloudera.com Alan Choi
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: