Details
-
Bug
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
1.0.0
-
None
-
None
Description
Neko1.9.11 goes into a loop on some documents e.g.
http://mediacet.com/Archive/FourYorkshiremen/bb/post.htm
http://cizel.co.kr/main.php
reverting to 0.9.4 seems to fix the problem
The approach mentioned in https://issues.apache.org/jira/browse/NUTCH-696 could be a way to alleviate similar issues
PS: haven't had time to report to the Neko people yet, will do at some stage