Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-564

External parser supports encoding attribute

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 0.9.0
    • nutchgora
    • indexer
    • None
    • All

    • Patch Available

    Description

      When an external component generates text, which is returned to the external parser, it always converts the text using the default character set. (os.toString()). For example, the returned text may be utf-8, but will not be converted to a String correctly.

      I added the attribute <encoding> to the <implementation> XML in plugin.xml and this is then used to convert the text.

      I have tested my original fix on my local 0.9 and include a patch, but have also made an untested patch for trunk.

      Attachments

        1. ExtParser_0.9.0.patch
          2 kB
          Antony Bowesman
        2. ExtParser_1.0.0.patch
          2 kB
          Antony Bowesman

        Activity

          People

            chrismattmann Chris A. Mattmann
            adb Antony Bowesman
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: