Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.13
-
None
-
None
-
Patch Available
Description
When fetcher.follow.outlinks.depth is non-zero, fetcher follows outlinks. This patch keeps track of already fetched URL's and thus avoid fetching the same URL twice.
A Set is used to keep track of them, hashcodes to reduce memory usage. This is not used if fetcher doesn't follow outlinks.