Description
The follow outlinks feature already respects the db.ignore.external.links setting. However, this means that outlinks of fetched pages that are external are not saved in parse data. There should be a new setting to prevent the outlink follower from going external but still storing external outlinks.
Attachments
Attachments
Issue Links
- is part of
-
NUTCH-1184 Fetcher to parse and follow Nth degree outlinks
- Closed