Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2414

Allow LanguageIndexingFilter to actually filter documents by language.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.13
    • 1.14
    • plugin
    • None

    Description

      It is often useful to only index pages in select languages (e.g. only those languages that we intend to search in). At first glance it seems that this is done by LanguageIndexingFilter, but currently all the filter does is add the language as a field to the index.
      We can add a configuration property to LanguageIndexingFilter that will allow it to only index languages specified in this property.

      Attachments

        Issue Links

          Activity

            People

              lewismc Lewis John McGibbney
              yossi Yossi Tamari
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: