Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-501

Medium-scale web crawl with hopcount-based filtering fails to find correct number of documents

    XMLWordPrintableJSON

Details

    Description

      The new web crawler Postgresql load test, which uses hopcount-based filtering, does not discover all 11110 documents it is supposed to. It only discovered 10603 when I ran it just now.

      Attachments

        1. capture.txt
          199 kB
          Karl Wright

        Activity

          People

            kwright@metacarta.com Karl Wright
            kwright@metacarta.com Karl Wright
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: