Description
The Nutch run script (bin/nutch) sets a "conservative" Java heap size of 1000 MB. This default was defined 15 years ago. It's probably safe to increase the heap size to a value suitable to process more pages or larger documents. What about 4096 MB?
Note this overlaps with NUTCH-2501 (Java heap size defined via mapred.child.java.opts in distributed mode).