Description
Hash based duplicate document detection is efficient and allows for blocking as well as field collapsing. Lets put it into solr.
Hash based duplicate document detection is efficient and allows for blocking as well as field collapsing. Lets put it into solr.