Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-14841 Replication - Phase 2
  3. HIVE-17196

CM: ReplCopyTask should retain the original file names even if copied from CM path.

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.1.0
    • 3.0.0
    • repl
    • None
    • Reviewed

    Description

      Consider the below scenario,
      1. Insert into table T1 with value(X).
      2. Insert into table T1 with value(X).
      3. Truncate the table T1.
      – This step backs up 2 files with same content to cmroot which ends up with one file in cmroot as checksum matches.
      4. Incremental repl with above 3 operations.
      – In this step, both the insert event files will be read from cmroot where copy of one leads to overwrite the other one as the file name is same in cm path (checksum as file name).
      So, this leads to data loss and hence it is necessary to retain the original file names even if we copy from cm path.

      Attachments

        1. HIVE-17196.1.patch
          9 kB
          Daniel Dai
        2. HIVE-17196.2.patch
          11 kB
          Daniel Dai

        Activity

          People

            daijy Daniel Dai
            sankarh Sankar Hariappan
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: