Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.6
-
None
-
None
-
Patch Available
Description
At present, the url partition logic is hard wired inside nutch core. It should be pluggable like FetchSchedule customized via nutch-site.xml.
There might be use cases where a single domain needs to be partioned on some custom logic. The existing UrlPartitioner cannot handle such cases.
Hence the requirement.