Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10156

Add significantTerms Streaming Expression

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Resolved
    • None
    • 6.5
    • None
    • None

    Description

      The significantTerms Streaming Expression will emit a set of terms from a text field within a doc frequency range for a specific query. It will also score the terms based on how many times the terms appear in the result set, and how many times the terms appear in the corpus, and return the top N terms based on this significance score.

      Syntax:

      significantTerms(collection, 
                       q="any query", 
                       field="some_text_field", 
                       minDocFreq="5",   //optional default is 5 documents
                       maxDocFreq=".3", // optional default is no more then 30% of the index (.3)
                       minTermLength="4",  // optional default is 4
                       limit="50")                // optional default is 20
      

      Attachments

        1. SOLR-10156.patch
          23 kB
          Joel Bernstein
        2. SOLR-10156.patch
          30 kB
          Joel Bernstein
        3. SOLR-10156.patch
          36 kB
          Joel Bernstein

        Issue Links

          Activity

            People

              jbernste Joel Bernstein
              jbernste Joel Bernstein
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: