Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1690

IndexClean: mark url as unindexed after clean to not delete again

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Auto Closed
    • None
    • 2.5
    • indexer
    • None
    • Patch Available

    Description

      We should marked a deleted page to not delete it again and again. That can simply done by remove Index marker when we delete.
      I also change to delete duplicated url in solrclean.

      Attachments

        1. NUTCH-1690.patch
          5 kB
          Tien Nguyen Manh

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tiennm Tien Nguyen Manh
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: