Issue Details (XML | Word | Printable)

Key: NUTCH-338
Type: Improvement Improvement
Status: Closed Closed
Resolution: Fixed
Priority: Trivial Trivial
Assignee: Chris A. Mattmann
Reporter: Chris A. Mattmann
Votes: 0
Watchers: 0
Operations

If you were logged in you would be able to see more operations.
Nutch

Remove the text parser as an option for parsing PDF files in parse-plugins.xml

Created: 03/Aug/06 03:32 PM   Updated: 24/Sep/06 03:30 PM
Return to search
Component/s: fetcher
Affects Version/s: 0.8
Fix Version/s: 0.8.1, 0.9.0

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works NUTCH-338.Mattmann.patch.txt 2006-08-03 03:34 PM Chris A. Mattmann 0.4 kB
Environment: Mac Book Pro Dual Core Intel 2.1 Ghz, although improvement is independent of environment
Issue Links:
Incorporates
 
Reference
 

Resolution Date: 18/Aug/06 03:11 PM


 Description  « Hide
After some discussion on the mailing list, it was decided that parse-text should not really be an option to parse PDF content. So, this issue includes a trivial patch to remove the parse text plugin from being mapped to PDF content in parse-pugins.xml.

 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Chris A. Mattmann made changes - 03/Aug/06 03:33 PM
Field Original Value New Value
Status Open [ 1 ] In Progress [ 3 ]
Chris A. Mattmann made changes - 03/Aug/06 03:34 PM
Attachment NUTCH-338.Mattmann.patch.txt [ 12338076 ]
Sami Siren made changes - 18/Aug/06 03:11 PM
Resolution Fixed [ 1 ]
Status In Progress [ 3 ] Resolved [ 5 ]
Sami Siren made changes - 19/Aug/06 05:23 AM
Fix Version/s 0.8.1 [ 12312020 ]
Stefan Neufeind made changes - 07/Sep/06 10:49 PM
Link This issue relates to NUTCH-290 [ NUTCH-290 ]
Stefan Neufeind made changes - 07/Sep/06 10:50 PM
Link This issue relates to NUTCH-335 [ NUTCH-335 ]
Stefan Neufeind made changes - 07/Sep/06 10:52 PM
Link This issue is part of NUTCH-362 [ NUTCH-362 ]
Sami Siren made changes - 24/Sep/06 03:30 PM
Status Resolved [ 5 ] Closed [ 6 ]