Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1441

AnchorIndexingFilter should use plain HashSet

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.6, 2.1
    • Component/s: None
    • Labels:
      None
    • Patch Info:
      Patch Available

      Description

      AnchorIndexingFilter should use a plain HashSet, instead of WeakHashMap. WeakHashMap is unnecessary and can perhaps even cause bugs. (A WeakHashMap get its entries removed when the gc notices the keys are not elsewhere in use.)

      This patch also makes the filter a bit faster by lazy instantiating the set. (No need to create one everytime when deduplication is off).

        Attachments

        1. NUTCH-1441.patch
          2 kB
          Ferdy
        2. NUTCH-1441-trunk.patch
          2 kB
          Ferdy

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              ferdy.g Ferdy
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: