Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-10314

A new tool to sync current HDFS view to specified snapshot

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • None
    • tools
    • None

    Description

      HDFS-9820 proposed adding -rdiff switch to distcp, as a reversed operation of -diff switch.

      Upon discussion with jingzhao, we will introduce a new tool that wraps around distcp to achieve the same purpose.

      I'm thinking about calling the new tool "rsync", similar to unix/linux command "rsync". The "r" here means remote.

      The syntax that simulate -rdiff behavior proposed in HDFS-9820 is

      rsync <fromSnapshotName>  <toSnapshotName>  <source> <target>
      

      This command ensure <fromSnapshotName> is newer than <toSnapshotName>.

      I think, In the future, we can add another command to have the functionality of -diff switch of distcp.

      sync <fromSnapshotName>  <toSnapshotName>  <source> <target>
      

      that ensures <fromSnapshotName> is older than <toSnapshotName>.

      Thanks jingzhao.

      Attachments

        1. HDFS-10314.001.patch
          95 kB
          Yongjun Zhang

        Issue Links

          Activity

            People

              yzhangal Yongjun Zhang
              yzhangal Yongjun Zhang
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated: