Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-11765

ReplicationSink should merge the Put/Delete of the same row into one Action even if they are from different hlog entry.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Abandoned
    • 0.94.7
    • None
    • Performance, Replication
    • None

    Description

      The current replicationSink code make sure it will only create one Put/Delete action of the kv of same row if it is from same hlog entry. However, when the same row of Put/Delete exist in different hlog entry, multiple Put/Delete action will be created, this will cause synchronization cost during the multi batch operation.

      In one of our application traffic pattern which has delete for same row twice for many rows, we saw doMiniBatchMutation() is invoked many times due to the row lock for the same row. ReplicationSink side is super slow, and replication queue build up.

      We should put the put/delete for the same row into one Put/Delete action even if they are from different hlog entry.

      Attachments

        1. HBASE-11765.patch
          4 kB
          Tianying Chang

        Activity

          People

            Unassigned Unassigned
            tychang Tianying Chang
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: