Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-21269

Mandate -update and -delete as DistCp options to sync data files for external tables replication.

    XMLWordPrintableJSON

Details

    Description

      Currently, external tables replication, copies the data in directory level. So, if target directory exist, then DistCp should compare and update or skip data files in the directory instead of creating new directory inside pre-existing target directory.
      This can be achieved using -update.
      Also, -delete option is needed to delete the files missing in source directory but present in target.
      Hive should mandate these DistCp options even if user passes other options.

      Attachments

        1. HIVE-21269.02.patch
          12 kB
          Sankar Hariappan
        2. HIVE-21269.01.patch
          8 kB
          Sankar Hariappan

        Issue Links

          Activity

            People

              sankarh Sankar Hariappan
              sankarh Sankar Hariappan
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m