Uploaded image for project: 'cTAKES'
  1. cTAKES
  2. CTAKES-67

LuceneDictionaryImpl and iv_maxHits

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.0-incubating
    • None
    • None

    Description

      The default value for LuceneDictionaryImpl's iv_maxHits is Integer.MAX_VALUE, and this class uses iv_maxHits in this call:

      topDoc = iv_searcher.search(query.parse(str.replace('-', ' ')), iv_maxHits);

      This search method is only intended for use with a max hits up to about 100 items:

      https://issues.apache.org/jira/browse/LUCENE-4368?focusedCommentId=13451578&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13451578

      If you really want all the hits, then it'll speed things up (by a lot, in my experience) to write a simple Collector subclass that just collects them all.

      Attachments

        Activity

          People

            Unassigned Unassigned
            steven.bethard Steven Bethard
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: