Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-20844

Duplicate rows returned while hbase snapshot reads

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: 1.3.1
    • Fix Version/s: None
    • Component/s: mapreduce, snapshots, spark
    • Labels:
      None
    • Environment:

      Cluster Details

      Java 1.7
      Hbase 1.3.1
      Spark 1.6.1

      Description

      We are trying to take snapshot from code and read data using MR and spark, both approaches are returning duplicate records.

      On the API side, {{org.apache.hadoop.hbase.mapreduce.TableSnapshotInputFormat }} is used.

      Snapshot was taken during the table was in a region split state.

      We suspect it is due to data is being returned for both parent and daughter regions.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                shivakumar.ss ShivaKumar SS
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: