Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-1299

Contribution of Lucene CAS Indexer

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.3S
    • Sandbox-Lucas
    • None

    Description

      Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.

      Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

      Attachments

        1. lucene-indexer.tar.gz
          57 kB
          Rico Landefeld
        2. pom.xml
          8 kB
          Rico Landefeld
        3. lucas.tar.gz
          80 kB
          Rico Landefeld

        Issue Links

          Activity

            People

              joern Jörn Kottmann
              rico.la Rico Landefeld
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: