Details
Description
We should marked a deleted page to not delete it again and again. That can simply done by remove Index marker when we delete.
I also change to delete duplicated url in solrclean.
Attachments
Attachments
Issue Links
- depends upon
-
NUTCH-1688 Port DeleteDuplicate based on crawlDB only to 2.x
- Closed