Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1558

CharEncodingForConversion in ParseData's ParseMeta, not in ParseData's ContentMeta

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • None
    • 1.8
    • parser
    • None
    • Patch Available

    Description

      This patch from GitHub user ysc fixes two bugs related to character encoding:

      • CharEncodingForConversion in ParseData's ParseMeta, not in ParseData's ContentMeta
      • if http response Header Content-Type return wrong codingļ¼Œthen get coding from the original content of the page

      Information about this pull request is here: http://s.apache.org/VOP

      Attachments

        Activity

          People

            chrismattmann Chris A. Mattmann
            chrismattmann Chris A. Mattmann
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: