Nutch
  1. Nutch
  2. NUTCH-1294

IndexClean job with solr implementation.

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: nutchgora
    • Fix Version/s: 2.3
    • Component/s: None
    • Labels:
      None
    • Patch Info:
      Patch Available

      Description

      I started by copying/altering the trunk version of SolrClean, though is was inadequate for our needs. We needed to mark particular pages as gone even though they still might be visible on the web, this implementation abstracts the index cleaning process, has a Solr implementation, and adds a clean index plugin extension that allows others to tailor how pages might be removed from their store.

      1. NUTCH-1294-v3.patch
        18 kB
        Claudiu Chis
      2. NUTCH-1294-v2.patch
        18 kB
        Lewis John McGibbney
      3. NUTCH-1294.patch
        17 kB
        Dan Rosher

        Activity

        No work has yet been logged on this issue.

          People

          • Assignee:
            Unassigned
            Reporter:
            Dan Rosher
          • Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development