Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-7316

DistCp cannot handle ":" colon in filename

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • None
    • None
    • None
    • None

    Description

      Similar to HDFS-13. If a source directory for distCP contains a file with a colon ":", the file will not be copied.

      Example error message:

      java.lang.Exception: java.lang.IllegalArgumentException: Pathname /user/pk1/RECORDS/MasterLink-pk1.gateway2.example.com:22.10:22:30 from hdfs:/access01.mgt.gateway2.example.com:8020/user/pk1/RECORDS/MasterLink-pk1.gateway2.example.com:22.10:22:30 is not a valid DFS filename.
      at org.apache.hadoop.mapred.example.comJobRunner$Job.runTasks(LocalJobRunner.java:462)
      at org.apache.hadoop.mapred.example.comJobRunner$Job.run(LocalJobRunner.java:522)
      Caused by: java.lang.IllegalArgumentException: Pathname /user/pk1/RECORDS/MasterLink-pxj29.gateway2.example.com:22.10:22:30 from hdfs:/access01.mgt.gateway2.example.com:8020/user/pk1/RECORDS/MasterLink-pk1.gateway2.example.com:22.10:22:30 is not a valid DFS filename.
      at org.apache.hadoop.hdfs.DistributedFileSystem.getPathName(DistributedFileSystem.java:195)
      at org.apache.hadoop.hdfs.DistributedFileSystem.access$000(DistributedFileSystem.java:104)
      at org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:1079)
      at org.apache.hadoop.hdfs.DistributedFileSystem$17.doCall(DistributedFileSystem.java:1075)
      at org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
      at org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1075)
      at org.apache.hadoop.tools.mapred.CopyMapper.map(CopyMapper.java:229)
      at org.apache.hadoop.tools.mapred.CopyMapper.map(CopyMapper.java:50)
      at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:145)
      at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:784)
      at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
      at org.apache.hadoop.mapred.example.comJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:243)
      at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
      at java.util.concurrent.FutureTask.run(FutureTask.java:262)
      at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
      at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
      at java.lang.Thread.run(Thread.java:745)

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              prjoslin Paul Joslin
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: