Details
-
Sub-task
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
The Common Crawl dataset contains thousands of hash files generated per the Ripemd 160 hashing algorithm (http://justsolve.archiveteam.org/wiki/RIPEMD-160). These files are 512 bytes long, have no magic, but end with a .rmd160 extension.
That extension is sufficiently unique to serve as a magic number (where it's used) for these files.