Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-232

TextInputFormat should support character encoding settings

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None
    • Windows XP SP3

    Description

      I need to read text files in different character encoding from UTF-8,
      but I think TextInputFormat doesn't support such character encoding.

      I suggest the TextInputFormat to support encoding settings like this.
      conf.set("io.file.defaultEncoding", "MS932");

      I will submit a patch candidate.

      Attachments

        1. Hadoop-3481.patch
          4 kB
          NOMURA Yoshihide

        Activity

          People

            Unassigned Unassigned
            yoshimov NOMURA Yoshihide
            Votes:
            1 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated: