Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2325

Inject REST call to set overwrite and update parameters

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.12
    • None
    • injector
    • None

    Description

      INJECT REST uses the old method invocation which sets overwrite and update to false, which is wrong.

      https://github.com/apache/nutch/blob/master/src/java/org/apache/nutch/crawl/Injector.java#L514

      There should be an option to set the overwrite and update in the REST request.

      eg:

      POST /job/create
       {
           "type":"INJECT",
           "confId":"default", 
           "crawlId":"TestCrawl",
           "args": {
                       "url_dir":"c:\\cygwin64\\tmp\\1475752235404-0",
                      "overwrite":"true",
                      "update":"true"
                      }
       }
      

      Attachments

        1. NUTCH-2325_1.patch
          1 kB
          Sujan Kumar Suppala
        2. NUTCH-2325.patch
          1 kB
          Markus Jelsma

        Activity

          People

            Unassigned Unassigned
            ssuppala Sujan Kumar Suppala
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: