Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1294

IndexClean job with solr implementation.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • nutchgora
    • 2.3
    • None
    • None
    • Patch Available

    Description

      I started by copying/altering the trunk version of SolrClean, though is was inadequate for our needs. We needed to mark particular pages as gone even though they still might be visible on the web, this implementation abstracts the index cleaning process, has a Solr implementation, and adds a clean index plugin extension that allows others to tailor how pages might be removed from their store.

      Attachments

        1. NUTCH-1294-v3.patch
          18 kB
          Claudiu Chis
        2. NUTCH-1294-v2.patch
          18 kB
          Lewis John McGibbney
        3. NUTCH-1294.patch
          17 kB
          Dan Rosher

        Activity

          People

            Unassigned Unassigned
            rosher Dan Rosher
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: