Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-12078

Missing Data when scanning using PREFIX_TREE DATA-BLOCK-ENCODING

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 0.98.6.1
    • Fix Version/s: 0.98.7, 0.99.1
    • Component/s: None
    • Labels:
      None
    • Environment:

      CentOS 6.3
      hadoop 2.5.0(hdfs)
      hadoop 2.2.0(hbase)
      hbase 0.98.6.1
      sun-jdk 1.7.0_67-b01

    • Hadoop Flags:
      Reviewed

      Description

      our row key is combined with two ints, and we found that sometimes when we using only the first int part to scan, the result returned may missing some rows. But when we dump the whole hfile, the row is still there.

      We have written a testcase to reproduce the bug. It works like this:

      put 1-12345
      put 12345-0x01000000
      put 12345-0x01010000
      put 12345-0x02000000
      put 12345-0x02020000
      put 12345-0x03000000
      put 12345-0x03030000
      put 12345-0x04000000
      put 12345-0x04040000

      flush memstore

      then scan using 12345,the returned row key will be 12345-0x20000000(12345-0x10000000 expected)

        Attachments

        1. HBASE-12078-0.98.patch
          11 kB
          Andrew Kyle Purtell
        2. HBASE-12078_1.patch
          11 kB
          Duo Zhang
        3. HBASE-12078.patch
          9 kB
          Duo Zhang
        4. prefix_tree_error.patch
          9 kB
          Duo Zhang

          Issue Links

            Activity

              People

              • Assignee:
                zhangduo Duo Zhang
                Reporter:
                zhangduo Duo Zhang
              • Votes:
                0 Vote for this issue
                Watchers:
                11 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: