Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-22448

Scan is slow for Multiple Column prefixes

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Won't Fix
    • Affects Version/s: 1.4.8, 1.4.9
    • Fix Version/s: None
    • Component/s: Scanners
    • Labels:

      Description

      While scanning a row (around 10 lakhs columns) with 100 column prefixes, it takes around 4 seconds in hbase-1.2.5 and when the same query is executed in hbase-1.4.9 it takes around 50 seconds.

      Is there any way to optimise this?

       

      P.S:

      We have applied the patch provided in HBASE-21620 and  HBASE-21734 . Attached qualifiers.txt file which contains the column keys. Use the HBaseFileImport.java file provided to populate in your table and use scanquery.txt to query.

        Attachments

        1. HBaseFileImport.java
          2 kB
          Karthick
        2. scanquery.txt
          3 kB
          Karthick
        3. qualifiers.txt
          26.29 MB
          Karthick
        4. 0001-benchmark-UT.patch
          5 kB
          Zheng Hu
        5. org.apache.hadoop.hbase.filter.TestSlowColumnPrefix-output.zip
          915 kB
          ramkrishna.s.vasudevan
        6. filter-list-with-or-internal-2.png
          77 kB
          Zheng Hu

          Activity

            People

            • Assignee:
              openinx Zheng Hu
              Reporter:
              KarthickRam Karthick
            • Votes:
              1 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: