Description
URLExemptionFilter impementations are used to allow exemptions to external domain resources by overriding the db.ignore.external.links configuration setting. This is useful when the crawl is focused to a domain but resources like images are hosted on CDN.
Currently URLExemptionFilters] provides the following logging
INFO o.a.n.n.URLExemptionFilters LocalJobRunner Map Task Executor #0 Found 0 extensions at point:'org.apache.nutch.net.URLExemptionFilter'
I find this confusing. It would be better to log only if an URLExemptionFilter implementation is actually configured to be used at runtime.
I will provide a patch for this.
Attachments
Issue Links
- links to