Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-1299

Contribution of Lucene CAS Indexer

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.3S
    • Sandbox-Lucas
    • None

    Description

      Lucas is a UIMA CAS consumer component which writes CAS data into a Lucene index. It is based on a XML-based "mapping configuration file" in which the user can determine which UIMA annotations should be put into which Lucene field, and how this field is set up (e.g. indexed and/or stored). In addition, some basic functionality for (ontolgical) hypernym indexing is provided.

      Additionally, Lucas is able to perform offset-based token stream alignment and merging of UIMA annotations (via token position increment) in the same Lucene field (e.g. "documenttext" or "title")

      Attachments

        1. lucas.tar.gz
          80 kB
          Rico Landefeld
        2. pom.xml
          8 kB
          Rico Landefeld
        3. lucene-indexer.tar.gz
          57 kB
          Rico Landefeld

        Issue Links

          Activity

            People

              joern Jörn Kottmann
              rico.la Rico Landefeld
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: