Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-193

move NDFS and MapReduce to a separate project

    Details

    • Type: Task
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.8
    • Fix Version/s: 0.8
    • Component/s: None
    • Labels:
      None

      Description

      The NDFS and MapReduce code should move from Nutch to a new Lucene sub-project named Hadoop.

      My plan is to do this as follows:

      1. Move all code in the following packages from Nutch to Hadoop:

      org.apache.nutch.fs
      org.apache.nutch.io
      org.apache.nutch.ipc
      org.apache.nutch.mapred
      org.apache.nutch.ndfs

      These packages will all be renamed to org.apache.hadoop, and Nutch code will be updated to reflect this.

      2. Move selected classes from Nutch to Hadoop, as follows:

      org.apache.nutch.util.NutchConf -> org.apache.hadoop.conf.Configuration
      org.apache.nutch.util.NutchConfigurable -> org.apache.hadoop.Configurable
      org.apache.nutch.util.NutchConfigured -> org.apache.hadoop.Configured

      org.apache.nutch.util.Progress -> org.apache.hadoop.util.Progress
      org.apache.nutch.util.LogFormatter-> org.apache.hadoop.util.LogFormatter
      org.apache.nutch.util.Daemon -> org.apache.hadoop.util.Daemon

      3. Add a jar containing all of the above the Nutch's lib directory.

      Does this plan sound reasonable?

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                cutting Doug Cutting
                Reporter:
                cutting Doug Cutting
              • Votes:
                1 Vote for this issue
                Watchers:
                0 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: