Issue Details (XML | Word | Printable)

Key: NUTCH-442
Type: New Feature New Feature
Status: Closed Closed
Resolution: Fixed
Priority: Major Major
Assignee: Doğacan Güney
Reporter: rubdabadub
Votes: 17
Watchers: 20
Operations

If you were logged in you would be able to see more operations.
Nutch

Integrate Solr/Nutch

Created: 07/Feb/07 06:37 PM   Updated: 10/Apr/09 12:29 PM
Return to search
Component/s: indexer, searcher
Affects Version/s: None
Fix Version/s: 1.0.0

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works Crawl.patch 2008-05-12 08:19 PM Caspar MacRae 3 kB
Text File Licensed for inclusion in ASF works Indexer.patch 2008-05-14 02:04 AM Caspar MacRae 15 kB
Text File Licensed for inclusion in ASF works NUTCH-442_v4.patch 2007-11-19 09:12 PM Doğacan Güney 182 kB
Text File Licensed for inclusion in ASF works NUTCH-442_v5.patch 2008-04-17 02:14 PM Doğacan Güney 178 kB
Text File Licensed for inclusion in ASF works NUTCH-442_v6.patch.txt 2008-06-22 03:24 PM Julien Nioche 181 kB
Text File Licensed for inclusion in ASF works NUTCH-442_v7.patch.txt 2008-08-05 07:57 AM Guillaume Smet 188 kB
Text File Licensed for inclusion in ASF works NUTCH-442_v7a.patch.txt 2008-09-19 03:53 PM Nick Tkach 183 kB
Text File Licensed for inclusion in ASF works NUTCH-442_v8.patch 2008-10-09 12:00 PM Doğacan Güney 192 kB
Text File Licensed for inclusion in ASF works NUTCH_442_v3.patch 2007-08-06 08:47 AM Doğacan Güney 196 kB
Text File Licensed for inclusion in ASF works RFC_multiple_search_backends.patch 2007-07-31 01:19 PM Doğacan Güney 158 kB
XML File Licensed for inclusion in ASF works schema.xml 2007-07-31 02:01 PM Doğacan Güney 2 kB
Environment: Ubuntu linux

Resolution Date: 12/Jan/09 01:27 PM


 Description  « Hide
Hi:

After trying out Sami's patch regarding Solr/Nutch. Can be found here (http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html) and I can confirm it worked And that lead me to request the following :

I would be very very great full if this could be included in nutch 0.9 as I am trying to eliminate my python based crawler which post documents to solr. As I am in the corporate enviornment I can't install trunk version in the production enviornment thus I am asking this to be included in 0.9 release. I hope my wish would be granted.

I look forward to get some feedback.

Thank you.



 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Doğacan Güney made changes - 31/Jul/07 01:19 PM
Field Original Value New Value
Attachment RFC_multiple_search_backends.patch [ 12362875 ]
Doğacan Güney made changes - 31/Jul/07 02:01 PM
Attachment schema.xml [ 12362879 ]
Doğacan Güney made changes - 06/Aug/07 08:45 AM
Attachment NUTCH_442_v2.patch [ 12363223 ]
Doğacan Güney made changes - 06/Aug/07 08:47 AM
Attachment NUTCH_442_v3.patch [ 12363224 ]
Doğacan Güney made changes - 06/Aug/07 08:48 AM
Attachment NUTCH_442_v2.patch [ 12363223 ]
Doğacan Güney made changes - 19/Nov/07 09:12 PM
Attachment NUTCH-442_v4.patch [ 12369823 ]
Doğacan Güney made changes - 17/Apr/08 02:14 PM
Attachment NUTCH-442_v5.patch [ 12380395 ]
Caspar MacRae made changes - 12/May/08 08:19 PM
Attachment Crawl.patch [ 12381906 ]
Caspar MacRae made changes - 14/May/08 02:04 AM
Attachment Indexer.patch [ 12382008 ]
Julien Nioche made changes - 22/Jun/08 03:24 PM
Attachment NUTCH-442_v6.patch.txt [ 12384452 ]
James Tan made changes - 30/Jul/08 07:51 AM
Comment [ I am facing the same issue that Vladimir Garvardt got above. Please see below. I basically check out the latest nutch version((Revision 680683) from http://svn.apache.org/repos/asf/lucene/nutch/trunk/ then apply only patch442_v6.patch. Do I need to apply any of the earlier patches with the latest nutch version(Revision 680683). Can anybody please advise on this? Thanks in advance!

.....
Indexer: starting
Indexer: crawldb: crawl.test/crawldb
Indexer: linkdb: crawl.test/linkdb
Indexer: solrUrl: http://localhost:8983/solr/
Indexer: adding segment: file:/nutch-solr/nutch-trunk/crawl.test/segments/20080729183600
Exception in thread "main" java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:894)
        at org.apache.nutch.indexer.Indexer.index(Indexer.java:319)
        at org.apache.nutch.crawl.Crawl.main(Crawl.java:148) ]
James Tan made changes - 30/Jul/08 07:52 AM
Comment [ Please disregard comment below. I am able to get it to work now. Thanks ]
Guillaume Smet made changes - 05/Aug/08 07:57 AM
Attachment NUTCH-442_v7.patch.txt [ 12387544 ]
Nick Tkach made changes - 19/Sep/08 03:53 PM
Attachment NUTCH-442_v7a.patch.txt [ 12390519 ]
Doğacan Güney made changes - 09/Oct/08 11:56 AM
Assignee Doğacan Güney [ dogacan ]
Doğacan Güney made changes - 09/Oct/08 11:57 AM
Component/s searcher [ 11593 ]
Component/s indexer [ 11592 ]
Fix Version/s 1.0.0 [ 12312443 ]
Doğacan Güney made changes - 09/Oct/08 12:00 PM
Attachment NUTCH-442_v8.patch [ 12391810 ]
Doğacan Güney made changes - 12/Jan/09 01:27 PM
Resolution Fixed [ 1 ]
Status Open [ 1 ] Resolved [ 5 ]
Sami Siren made changes - 27/Mar/09 08:28 PM
Status Resolved [ 5 ] Closed [ 6 ]