Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-3788

distcp can't copy large files using webhdfs due to missing Content-Length header

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.23.3, 2.0.0-alpha
    • 0.23.3, 2.0.2-alpha
    • webhdfs
    • None
    • Reviewed

    Description

      The following command fails when data1 contains a 3gb file. It passes when using hftp or when the directory just contains smaller (<2gb) files, so looks like a webhdfs issue with large files.

      hadoop distcp webhdfs://eli-thinkpad:50070/user/eli/data1 hdfs://localhost:8020/user/eli/data2

      Attachments

        1. h3788_20120816.patch
          4 kB
          Tsz-wo Sze
        2. h3788_20120815.patch
          4 kB
          Tsz-wo Sze
        3. 20120814NullEntity.patch
          11 kB
          Tsz-wo Sze
        4. h3788_20120814b.patch
          4 kB
          Tsz-wo Sze
        5. h3788_20120814.patch
          4 kB
          Tsz-wo Sze
        6. h3788_20120813.patch
          4 kB
          Tsz-wo Sze
        7. distcp-webhdfs-errors.txt
          12 kB
          Eli Collins

        Issue Links

          Activity

            People

              szetszwo Tsz-wo Sze
              eli Eli Collins
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: