Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2504

Results of maxCountExpr and fetchDelayExpr should be stored in memory in Generate

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 1.15
    • 1.21
    • generator
    • None

    Description

      With NUTCH-2455 the expressions maxCountExpr and fetchDelayExpr are calculated for each value. That slows the process, instead we can store the results for each host in hostDomainCounts. 

      That will take only 2 x sizeof(long) extra memory per host.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            semyon.semyonov@mail.com Semyon Semyonov

            Dates

              Created:
              Updated:

              Slack

                Issue deployment