Description
NUTCH-1532 needs to obtain a batchId to add to NutchDocument prior to indexing. This is currently not available as we do not store the information in the WebPage. Additionally, we do not store the other ModifiedTime's but incorrectly set them in o.a.n.crawl.FetchSchedule#setFetchSchedule.
All the above accessors should be implemented.
Attachments
Attachments
Issue Links
- blocks
-
NUTCH-1532 Replace 'segment' mapping field with batchId
- Closed