Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-338

Remove the text parser as an option for parsing PDF files in parse-plugins.xml

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Trivial
    • Resolution: Fixed
    • 0.8
    • 0.8.1, 0.9.0
    • fetcher
    • None
    • Mac Book Pro Dual Core Intel 2.1 Ghz, although improvement is independent of environment

    Description

      After some discussion on the mailing list, it was decided that parse-text should not really be an option to parse PDF content. So, this issue includes a trivial patch to remove the parse text plugin from being mapped to PDF content in parse-pugins.xml.

      Attachments

        1. NUTCH-338.Mattmann.patch.txt
          0.4 kB
          Chris A. Mattmann

        Issue Links

          Activity

            People

              chrismattmann Chris A. Mattmann
              chrismattmann Chris A. Mattmann
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: