Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1294

IndexClean job with solr implementation.

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: nutchgora
    • Fix Version/s: 2.3
    • Component/s: None
    • Labels:
      None
    • Patch Info:
      Patch Available

      Description

      I started by copying/altering the trunk version of SolrClean, though is was inadequate for our needs. We needed to mark particular pages as gone even though they still might be visible on the web, this implementation abstracts the index cleaning process, has a Solr implementation, and adds a clean index plugin extension that allows others to tailor how pages might be removed from their store.

        Attachments

        1. NUTCH-1294-v3.patch
          18 kB
          Claudiu Chis
        2. NUTCH-1294-v2.patch
          18 kB
          Lewis John McGibbney
        3. NUTCH-1294.patch
          17 kB
          Dan Rosher

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              rosher Dan Rosher
            • Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: