Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-223

Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Trivial
    • Resolution: Fixed
    • Affects Version/s: 0.8
    • Fix Version/s: 1.0.0
    • Component/s: None
    • Labels:
      None
    • Environment:

      ALL

      Description

      Here is a patch for Crawl.java so that it the default value for -topN is the same as Generator.java

          • Crawl.java Fri Mar 3 23:54:28 2006
          • fix/Crawl.java Sat Mar 4 19:38:27 2006
            ***************
          • 59,65 ****
            File dir = new File("crawl-" + getDate());
            int threads = job.getInt("fetcher.threads.fetch", 10);
            int depth = 5;
            ! int topN = Integer.MAX_VALUE;

      for (int i = 0; i < args.length; i++) {
      if ("-dir".equals(args[i])) {
      — 59,65 ----
      File dir = new File("crawl-" + getDate());
      int threads = job.getInt("fetcher.threads.fetch", 10);
      int depth = 5;
      ! int topN = Long.MAX_VALUE;

      for (int i = 0; i < args.length; i++) {
      if ("-dir".equals(args[i])) {

        Attachments

          Activity

            People

            • Assignee:
              ab Andrzej Bialecki
              Reporter:
              dataspear Jeff Ritchie
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: