Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-3025

urlfilter-fast to filter based on the length of the URL

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Implemented
    • 1.19
    • 1.20
    • plugin, urlfilter
    • None

    Description

      There currently is no filter implementation to remove URLs based on their length or the length of their path / query.
      Doing so with the regex filter would be inefficient, instead we could implement it in _urlfilter-fast _

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              jnioche Julien Nioche
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: