Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-223

Crawl.java uses Integer.MAX_VALUE for -topN where Generator.java uses Long.MAX_VALUE for -topN

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Trivial
    • Resolution: Fixed
    • 0.8
    • 1.0.0
    • None
    • None
    • ALL

    Description

      Here is a patch for Crawl.java so that it the default value for -topN is the same as Generator.java

          • Crawl.java Fri Mar 3 23:54:28 2006
          • fix/Crawl.java Sat Mar 4 19:38:27 2006
            ***************
          • 59,65 ****
            File dir = new File("crawl-" + getDate());
            int threads = job.getInt("fetcher.threads.fetch", 10);
            int depth = 5;
            ! int topN = Integer.MAX_VALUE;

      for (int i = 0; i < args.length; i++) {
      if ("-dir".equals(args[i])) {
      — 59,65 ----
      File dir = new File("crawl-" + getDate());
      int threads = job.getInt("fetcher.threads.fetch", 10);
      int depth = 5;
      ! int topN = Long.MAX_VALUE;

      for (int i = 0; i < args.length; i++) {
      if ("-dir".equals(args[i])) {

      Attachments

        Activity

          People

            ab Andrzej Bialecki
            dataspear Jeff Ritchie
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: