Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-1536

Support for TokenFilters that may modify input documents

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 1.5
    • None
    • Schema and Analysis
    • None

    Description

      In some scenarios it's useful to be able to create or modify fields in the input document based on analysis of other fields of this document. This need arises e.g. when indexing multilingual documents, or when doing NLP processing such as NER. However, currently this is not possible to do.

      This issue provides an implementation of this functionality that consists of the following parts:

      • DocumentAlteringFilterFactory - abstract superclass that indicates that TokenFilter-s created from this factory may modify fields in a SolrInputDocument.
      • TypeAsFieldFilterFactory - example implementation that illustrates this concept, with a JUnit test.
      • DocumentBuilder modifications to support this functionality.

      Attachments

        1. altering.patch
          26 kB
          Andrzej Bialecki
        2. altering.patch
          26 kB
          Andrzej Bialecki
        3. altering.patch
          26 kB
          Andrzej Bialecki

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ab Andrzej Bialecki
              Votes:
              4 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: