Details
-
Task
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.8
-
None
-
None
Description
The NDFS and MapReduce code should move from Nutch to a new Lucene sub-project named Hadoop.
My plan is to do this as follows:
1. Move all code in the following packages from Nutch to Hadoop:
org.apache.nutch.fs
org.apache.nutch.io
org.apache.nutch.ipc
org.apache.nutch.mapred
org.apache.nutch.ndfs
These packages will all be renamed to org.apache.hadoop, and Nutch code will be updated to reflect this.
2. Move selected classes from Nutch to Hadoop, as follows:
org.apache.nutch.util.NutchConf -> org.apache.hadoop.conf.Configuration
org.apache.nutch.util.NutchConfigurable -> org.apache.hadoop.Configurable
org.apache.nutch.util.NutchConfigured -> org.apache.hadoop.Configured
org.apache.nutch.util.Progress -> org.apache.hadoop.util.Progress
org.apache.nutch.util.LogFormatter-> org.apache.hadoop.util.LogFormatter
org.apache.nutch.util.Daemon -> org.apache.hadoop.util.Daemon
3. Add a jar containing all of the above the Nutch's lib directory.
Does this plan sound reasonable?
Attachments
Issue Links
- incorporates
-
HADOOP-1 initial import of code from Nutch
- Closed