Description
When a crawl datum contains a WRITABLE_GENERATE_TIME_KEY metadata it will wait forever.
LongWritable oldGenTime = (LongWritable) crawlDatum.getMetaData() .get(Nutch.WRITABLE_GENERATE_TIME_KEY); if (oldGenTime != null) { // awaiting fetch & update if (oldGenTime.get() + genDelay > curTime) // still wait for // update context.getCounter("Generator", "WAIT_FOR_UPDATE").increment(1); return; }
The line
context.getCounter("Generator", "WAIT_FOR_UPDATE").increment(1);
has been introduced in version 1.16 without adding the brackets.
Attachments
Issue Links
- is caused by
-
NUTCH-2737 Generator: count and log reason of rejections during selection
- Closed
- links to