Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-3054

distcp -skipcrccheck has no effect

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.23.2, 2.0.0-alpha, 2.0.1-alpha, 2.0.2-alpha
    • 2.0.2-alpha
    • tools
    • None

    Description

      Using distcp with '-skipcrccheck' still seems to cause CRC checksums to happen.

      Ran into this while debugging an issue associated with source and destination having different blocksizes, and not using the preserve blocksize parameter (-pb). In both 23.1 and 23.2 builds, trying to bypass the checksum verification by using the '-skipcrcrcheck' parameter had no effect, the distcp still failed on checksum errors.

      Test scenario to reproduce;
      do not use '-pb' and try a distcp from 20.205 (default blksize=128M) to .23 (default blksize=256M), the distcp fails on checksum errors, which is expected due to checksum calculation (tiered aggregation of all blks). Trying the same distcp only providing '-skipcrccheck' still fails with the same checksum error, it is expected that checksum would now be bypassed and the distcp would proceed.

      Attachments

        1. hdfs-3054.patch
          3 kB
          Rahul Jain
        2. HDFS-3054.002.patch
          3 kB
          Colin McCabe
        3. HDFS-3054.004.patch
          3 kB
          Colin McCabe

        Issue Links

          Activity

            People

              cmccabe Colin McCabe
              patwhitey2007 patrick white
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: