Issue Details (XML | Word | Printable)

Key: NUTCH-338
Type: Improvement Improvement
Status: Closed Closed
Resolution: Fixed
Priority: Trivial Trivial
Assignee: Chris A. Mattmann
Reporter: Chris A. Mattmann
Votes: 0
Watchers: 0
Operations

If you were logged in you would be able to see more operations.
Nutch

Remove the text parser as an option for parsing PDF files in parse-plugins.xml

Created: 03/Aug/06 03:32 PM   Updated: 24/Sep/06 03:30 PM
Return to search
Component/s: fetcher
Affects Version/s: 0.8
Fix Version/s: 0.8.1, 0.9.0

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works NUTCH-338.Mattmann.patch.txt 2006-08-03 03:34 PM Chris A. Mattmann 0.4 kB
Environment: Mac Book Pro Dual Core Intel 2.1 Ghz, although improvement is independent of environment
Issue Links:
Incorporates
 
Reference
 

Resolution Date: 18/Aug/06 03:11 PM


 Description  « Hide
After some discussion on the mailing list, it was decided that parse-text should not really be an option to parse PDF content. So, this issue includes a trivial patch to remove the parse text plugin from being mapped to PDF content in parse-pugins.xml.

 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Chris A. Mattmann added a comment - 03/Aug/06 03:34 PM
simple patch for removing the parse-text plugin from being mapped to PDF content type in parse-plugins.xml.

Chris A. Mattmann added a comment - 18/Aug/06 02:54 PM
Hi Andrzej,

A patch is available that you can apply quickly to remove the text parser as an option for pdf. Could you review this * brief * patch, and then commit? This was the resolution that we decided, no?

Thanks!

Cheers,
Chris


Sami Siren added a comment - 18/Aug/06 03:11 PM
This is now committed, thank you.

The patch was broken, hopefully I got it right.


Chris A. Mattmann added a comment - 18/Aug/06 03:20 PM
Hi Sami,

Thanks much. It's weird that it was broken seeing as it was a one line patch, however, I tried it again in my own environment, and indeed for whatever reason the one line patch was rejected

Last time I use eclipse to generate my one line patch for me

Cheers,
Chris


Sami Siren added a comment - 18/Aug/06 03:25 PM
yeah, svn diff from commandline is the winner.