Hadoop Common
  1. Hadoop Common
  2. HADOOP-845

DFS -get and DFS -cat on a zip file generate different output

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 0.9.1
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      dfs -get file.gz localfile.gz and dfs -cat file.gz > /tmp/file.gz generate different output. dfs -get geenreates corect output but dfs -cat does not geenerate the right ouput.

        Issue Links

          Activity

          Hide
          Hadoop QA added a comment -

          -1, because the patch command could not apply the latest attachment (http://issues.apache.org) as a patch to trunk revision r489707. Please note that this message is automatically generated and may represent a problem with the automation system and not the patch.

          Show
          Hadoop QA added a comment - -1, because the patch command could not apply the latest attachment ( http://issues.apache.org ) as a patch to trunk revision r489707. Please note that this message is automatically generated and may represent a problem with the automation system and not the patch.
          Hide
          weilei added a comment -

          Some codes are added to org.apache.hadoop.dfs.DFSShell.java. then we can cat gzip file like others

          Show
          weilei added a comment - Some codes are added to org.apache.hadoop.dfs.DFSShell.java. then we can cat gzip file like others
          Hide
          Hairong Kuang added a comment -

          Hadoop dfs -cat does not handle non-ascii bytes correctly. So it can not cat zipped files correctly. Wendy's patch to HADOOP-628 should also be able to fix this bug.

          Show
          Hairong Kuang added a comment - Hadoop dfs -cat does not handle non-ascii bytes correctly. So it can not cat zipped files correctly. Wendy's patch to HADOOP-628 should also be able to fix this bug.
          Hide
          Hairong Kuang added a comment -

          Weilei's patch seems to implement zcat. You may refer to the method getRecordReader in hadoop.org.apache.mapred.TextInputFormat.java for a general implementation of zcat.

          Show
          Hairong Kuang added a comment - Weilei's patch seems to implement zcat. You may refer to the method getRecordReader in hadoop.org.apache.mapred.TextInputFormat.java for a general implementation of zcat.
          Hide
          Hairong Kuang added a comment -

          It's a duplicate of HADOOP-628.

          Show
          Hairong Kuang added a comment - It's a duplicate of HADOOP-628 .

            People

            • Assignee:
              Unassigned
              Reporter:
              Mahadev konar
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development