Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2420

Bug in variable generate.max.count and fetcher.server.delay

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: 1.14
    • Component/s: generator
    • Labels:
      None

      Description

      Feature added by NUTCH-2368 does not work for multiple hosts. Once a HostDatum has been read by getHostDatum(), the next host cannot be read. Apparantly i need to open and close the SequenceFile.Readers for every HostDatum it needs. Reader has no reset() method or whatsoever.

      1. NUTCH-2420.patch
        2 kB
        Markus Jelsma

        Activity

        Hide
        markus17 Markus Jelsma added a comment -

        Patch for master! This calls open and reset each time a HostDatum is read. Although ugly, i've seen it work on our production system.

        Show
        markus17 Markus Jelsma added a comment - Patch for master! This calls open and reset each time a HostDatum is read. Although ugly, i've seen it work on our production system.

          People

          • Assignee:
            markus17 Markus Jelsma
            Reporter:
            markus17 Markus Jelsma
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:

              Development