Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-193

move NDFS and MapReduce to a separate project

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.8
    • 0.8
    • None
    • None

    Description

      The NDFS and MapReduce code should move from Nutch to a new Lucene sub-project named Hadoop.

      My plan is to do this as follows:

      1. Move all code in the following packages from Nutch to Hadoop:

      org.apache.nutch.fs
      org.apache.nutch.io
      org.apache.nutch.ipc
      org.apache.nutch.mapred
      org.apache.nutch.ndfs

      These packages will all be renamed to org.apache.hadoop, and Nutch code will be updated to reflect this.

      2. Move selected classes from Nutch to Hadoop, as follows:

      org.apache.nutch.util.NutchConf -> org.apache.hadoop.conf.Configuration
      org.apache.nutch.util.NutchConfigurable -> org.apache.hadoop.Configurable
      org.apache.nutch.util.NutchConfigured -> org.apache.hadoop.Configured

      org.apache.nutch.util.Progress -> org.apache.hadoop.util.Progress
      org.apache.nutch.util.LogFormatter-> org.apache.hadoop.util.LogFormatter
      org.apache.nutch.util.Daemon -> org.apache.hadoop.util.Daemon

      3. Add a jar containing all of the above the Nutch's lib directory.

      Does this plan sound reasonable?

      Attachments

        Issue Links

          Activity

            People

              cutting Doug Cutting
              cutting Doug Cutting
              Votes:
              1 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: