Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1441

AnchorIndexingFilter should use plain HashSet

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 1.6, 2.1
    • None
    • None
    • Patch Available

    Description

      AnchorIndexingFilter should use a plain HashSet, instead of WeakHashMap. WeakHashMap is unnecessary and can perhaps even cause bugs. (A WeakHashMap get its entries removed when the gc notices the keys are not elsewhere in use.)

      This patch also makes the filter a bit faster by lazy instantiating the set. (No need to create one everytime when deduplication is off).

      Attachments

        1. NUTCH-1441.patch
          2 kB
          Ferdy
        2. NUTCH-1441-trunk.patch
          2 kB
          Ferdy

        Activity

          People

            Unassigned Unassigned
            ferdy.g Ferdy
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: