Description
When generating segments with topN and maxNumSegments, topN is not respected. It looks like the first generated segment contains topN * maxNumSegments of URLs's, at least the number of map input records roughly matches.
Attachments
Attachments
Issue Links
- is related to
-
NUTCH-762 Alternative Generator which can generate several segments in one parse of the crawlDB
- Closed