Description
Some parsers have no Outlinks returned. E.g. the Word-Parser.
This class is able to extract (absolute) hyperlinks from a plain String (content) and generates outlinks from them.
This would be very usful for parser which have no explicite extraction of hyperlinks.
Excample:
Outlink[] links = OutlinkExtractor.getOutlinks("Nutch is located at http://www.apache.org and ...");
Will return an array of Outlinks containing the one element of "http://www.apache.org".
transfered from: http://sourceforge.net/tracker/index.php?func=detail&aid=1109328&group_id=59548&atid=491356
submitted by: Stephan Strittmatter