Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1657

ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION never set in HTMLParser

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 2.2.1
    • 2.3
    • None
    • None
    • Patch Available

    Description

      ORIGINAL_CHAR_ENCODING and CHAR_ENCODING_FOR_CONVERSION are never set in HTMLParser.java.
      In 2.x, we didn't set this value any field. Actually we never use this value in 2.x I thought delete them. But Feng Lu guided me and I will set metadata field.

      Attachments

        1. NUTCH-1657.patch
          1 kB
          Talat Uyarer

        Issue Links

          Activity

            People

              talat Talat Uyarer
              talat Talat Uyarer
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: