Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-21213

Acid table bootstrap replication needs to handle directory created by compaction with txn id

Log workAgile BoardRank to TopRank to BottomBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      The current implementation of compaction uses the txn id in the directory name. This is used to isolate the queries from reading the directory until compaction has finished and to avoid the compactor marking used earlier. In case of replication, during bootstrap , directory is copied as it is with the same name from source to destination cluster. But the directory created by compaction with txn id can not be copied as the txn list at target may be different from source. The txn id which is valid at source may be an aborted txn at target. So conversion logic is required to create a new directory with valid txn at target and dump the data to the newly created directory.

      Attachments

        1. HIVE-21213.01.patch
          10 kB
          mahesh kumar behera
        2. HIVE-21213.02.patch
          13 kB
          mahesh kumar behera
        3. HIVE-21213.03.patch
          13 kB
          mahesh kumar behera
        4. HIVE-21213.04.patch
          18 kB
          mahesh kumar behera
        5. HIVE-21213.05.patch
          18 kB
          mahesh kumar behera

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            maheshk114 mahesh kumar behera Assign to me
            maheshk114 mahesh kumar behera
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

              Estimated:
              Original Estimate - Not Specified
              Not Specified
              Remaining:
              Remaining Estimate - 0h
              0h
              Logged:
              Time Spent - 2.5h
              2.5h

              Slack

                Issue deployment