Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1931 Apache Nutch 1.x REST service and crawler visualization
  3. NUTCH-2011

Endpoint to support realtime JSON output from the fetcher

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Reopened
    • Major
    • Resolution: Unresolved
    • None
    • None
    • fetcher, REST_api

    Description

      This fix will create an endpoint to query the Nutch REST service and get a real-time JSON response of the current/past Fetched URLs.

      This endpoint also includes pagination of the output to reduce data transfer bw in large crawls.

      Attachments

        Activity

          People

            chrismattmann Chris A. Mattmann
            sujenshah Sujen Shah
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: