Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2327

Seeds injected in REST workflow must be ingested into HDFS

VotersStop watchingWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.12
    • 1.13
    • injector, REST_api
    • None

    Description

      Right now when one uses the REST POST /seed/create API, a directory is created within /var/some/path/here which is create if you are working locally with the Nutch server e.g. on one machine. It is however not suitable for using the REST API in distributed deployments where seeds needs to be present within HDFS. More documentation on this topic is available at
      https://wiki.apache.org/nutch/Nutch_1.X_RESTAPI#Seed_List_creation
      There are also various mailing list threads regarding use of the REST and this injector url issue described above needs to be addressed.

      Sujen Shah CC for context.

      http://www.mail-archive.com/user%40nutch.apache.org/msg14922.html
      http://www.mail-archive.com/user%40nutch.apache.org/msg14921.html

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sujenshah Sujen Shah
            lewismc Lewis John McGibbney
            Votes:
            0 Vote for this issue
            Watchers:
            4 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment