Hi all, you can find in attachment a new patch including the support for -keyPrefix command line option. This is an optional flag that enables to add a prefix to every key value in the output format.
Moreover, in this patch, the CommonCrawlDataDumper tool uses the method in DumpFileUtil (NUTCH-1968) as suggested by Chris A. Mattmann.
- incorporates
-
NUTCH-1959 Improving CommonCrawlFormat implementations
-
- Resolved
-