Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-193

move NDFS and MapReduce to a separate project

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.8
    • 0.8
    • None
    • None

    Description

      The NDFS and MapReduce code should move from Nutch to a new Lucene sub-project named Hadoop.

      My plan is to do this as follows:

      1. Move all code in the following packages from Nutch to Hadoop:

      org.apache.nutch.fs
      org.apache.nutch.io
      org.apache.nutch.ipc
      org.apache.nutch.mapred
      org.apache.nutch.ndfs

      These packages will all be renamed to org.apache.hadoop, and Nutch code will be updated to reflect this.

      2. Move selected classes from Nutch to Hadoop, as follows:

      org.apache.nutch.util.NutchConf -> org.apache.hadoop.conf.Configuration
      org.apache.nutch.util.NutchConfigurable -> org.apache.hadoop.Configurable
      org.apache.nutch.util.NutchConfigured -> org.apache.hadoop.Configured

      org.apache.nutch.util.Progress -> org.apache.hadoop.util.Progress
      org.apache.nutch.util.LogFormatter-> org.apache.hadoop.util.LogFormatter
      org.apache.nutch.util.Daemon -> org.apache.hadoop.util.Daemon

      3. Add a jar containing all of the above the Nutch's lib directory.

      Does this plan sound reasonable?

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            cutting Doug Cutting
            cutting Doug Cutting
            Votes:
            1 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment