Description
I ran into this issue when merging a crawlDB originating from sitemaps into our normal crawlDB. CrawlDatums are merged based on output of AbstractFetchSchedule::calculateLastFetchTime(). When CrawlDatums are unfetched, this can overwrite fetchTime or other stuff.
I assume this is a bug and have a simple fix for it that checks if CrawlDatum has status db_unfetched.
Attachments
Issue Links
- links to