Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2574

Generator: hostCount >= maxCount comparison wrong

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.13
    • 1.15
    • generator
    • None

    Description

      In the Generator.Selector.reduce function, there is a comparison of hostCount[1] to maxCount, to determine whether or not to push the current URL to the next segment. The purpose is to honor generate.max.count.

      Sebastian noticed that it should test if (hostCount[1] > maxCount) rather than ">=".  As it stands, the code sometimes puts one less url into a segment than it should.

      Attachments

        Issue Links

          Activity

            People

              snagel Sebastian Nagel
              xoffey@gmail.com Michael Coffey
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: