Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-17498

Distcp the files was generated by concat(RpcMethod) meet error, because sourceFS's chesksum is not equals to targetFS's chesksum.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.3.4
    • None
    • distcp
    • None

    Description

      When we use distcp, the sourceFS's checksum and targetFS's checksum are checked for consistency after the file transfer is complete. 

      However, for some files produced by ClientProcotol's concat(RPC method) on the source side, the Block Size is less than 128MB(such as sourceFS file =10MB+10MB, targetFS file = 20MB), so the checksum of the source and destination side will be inconsistent, So It waill cause distcp failed.

       

      Case:

      hadoop fs -put /etc/hosts /tmp/a.txt
      hadoop fs -put /etc/hosts /tmp/b.txt
      hadoop fs -put /etc/hosts /tmp/c.txt
      hadoop fs -concat /tmp/a.txt /tmp/b.txt /tmp/c.txt
      hdfs fsck /tmp/a.txt -files -blocks -locations | grep blk_
      hadoop distcp /tmp/a.txt hdfs://kde-sts-0.com/tmp/res.txt

       

       

       

      Attachments

        1. image-2024-04-24-15-54-16-253.png
          114 kB
          xiaojunxiang
        2. image-2024-04-24-15-54-58-047.png
          85 kB
          xiaojunxiang
        3. image-2024-04-24-15-55-25-519.png
          100 kB
          xiaojunxiang
        4. image-2024-04-24-15-55-48-752.png
          58 kB
          xiaojunxiang
        5. image-2024-04-26-10-47-21-884.png
          212 kB
          xiaojunxiang
        6. image-2024-04-26-10-47-29-220.png
          14 kB
          xiaojunxiang
        7. image-2024-04-26-10-47-49-294.png
          237 kB
          xiaojunxiang
        8. image-2024-04-26-10-50-12-238.png
          212 kB
          xiaojunxiang
        9. image-2024-04-26-10-50-18-788.png
          14 kB
          xiaojunxiang
        10. image-2024-04-26-10-50-23-340.png
          237 kB
          xiaojunxiang

        Activity

          People

            Unassigned Unassigned
            bigdata_zoodev xiaojunxiang
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: