Description
In ParseOutputFormat, the toUrl is read from Outlink and is processed. This String object is filtered, normalized etc but the original Outlink object is actually added. The normalized url in toUrl is not written back to the Outlink object.
This issue adds a setUrl method to Outlink which is used in ParseOutputFormat to overwrite the unnormalized url.
Attachments
Attachments
Issue Links
- is part of
-
NUTCH-1184 Fetcher to parse and follow Nth degree outlinks
- Closed