Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
A DupeDB for Nutch and associated tools to create and read a database containing information on duplicates.
Attachments
Issue Links
- is depended upon by
-
NUTCH-1326 HostDeduplicator for Nutch
- Open
- is related to
-
NUTCH-656 DeleteDuplicates based on crawlDB only
- Closed