Uploaded image for project: 'ManifoldCF'
  1. ManifoldCF
  2. CONNECTORS-501

Medium-scale web crawl with hopcount-based filtering fails to find correct number of documents

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      The new web crawler Postgresql load test, which uses hopcount-based filtering, does not discover all 11110 documents it is supposed to. It only discovered 10603 when I ran it just now.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            kwright@metacarta.com Karl Wright
            kwright@metacarta.com Karl Wright
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment