Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-12078

Missing Data when scanning using PREFIX_TREE DATA-BLOCK-ENCODING

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.98.6.1
    • 0.98.7, 0.99.1
    • None
    • None
    • CentOS 6.3
      hadoop 2.5.0(hdfs)
      hadoop 2.2.0(hbase)
      hbase 0.98.6.1
      sun-jdk 1.7.0_67-b01

    • Reviewed

    Description

      our row key is combined with two ints, and we found that sometimes when we using only the first int part to scan, the result returned may missing some rows. But when we dump the whole hfile, the row is still there.

      We have written a testcase to reproduce the bug. It works like this:

      put 1-12345
      put 12345-0x01000000
      put 12345-0x01010000
      put 12345-0x02000000
      put 12345-0x02020000
      put 12345-0x03000000
      put 12345-0x03030000
      put 12345-0x04000000
      put 12345-0x04040000

      flush memstore

      then scan using 12345,the returned row key will be 12345-0x20000000(12345-0x10000000 expected)

      Attachments

        1. HBASE-12078-0.98.patch
          11 kB
          Andrew Kyle Purtell
        2. HBASE-12078_1.patch
          11 kB
          Duo Zhang
        3. HBASE-12078.patch
          9 kB
          Duo Zhang
        4. prefix_tree_error.patch
          9 kB
          Duo Zhang

        Issue Links

          Activity

            People

              zhangduo Duo Zhang
              zhangduo Duo Zhang
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: