Description
I observed two issues:
- When using the DefaultFetchSchedule, CrawlDatum's modifiedTime field is not updated on the first successful fetch.
- When a document modification is detected (protocol- or signature-wise), the modifiedTime isn't updated
I can provide a patch later today.
Attachments
Issue Links
- duplicates
-
NUTCH-2164 Inconsistent 'Modified Time' in crawl db
- Closed
- links to