Description
I'm testing Apache Nutch with the feed's plugin. I've noticed that for each page it generates the same digest/signature, therefore the dedup cleans everything up from the database.
I'm wondering why the class MD5Signature is the default one instead of TextMD5Signature.
Anyhow now I've modified a little bit the MD5Signature to let it work with the feed plugin