Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10156

Add significantTerms Streaming Expression

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Resolved
    • Affects Version/s: None
    • Fix Version/s: 6.5
    • Component/s: None
    • Security Level: Public (Default Security Level. Issues are Public)
    • Labels:
      None

      Description

      The significantTerms Streaming Expression will emit a set of terms from a text field within a doc frequency range for a specific query. It will also score the terms based on how many times the terms appear in the result set, and how many times the terms appear in the corpus, and return the top N terms based on this significance score.

      Syntax:

      significantTerms(collection, 
                       q="any query", 
                       field="some_text_field", 
                       minDocFreq="5",   //optional default is 5 documents
                       maxDocFreq=".3", // optional default is no more then 30% of the index (.3)
                       minTermLength="4",  // optional default is 4
                       limit="50")                // optional default is 20
      

        Attachments

        1. SOLR-10156.patch
          36 kB
          Joel Bernstein
        2. SOLR-10156.patch
          30 kB
          Joel Bernstein
        3. SOLR-10156.patch
          23 kB
          Joel Bernstein

          Issue Links

            Activity

              People

              • Assignee:
                joel.bernstein Joel Bernstein
                Reporter:
                joel.bernstein Joel Bernstein
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: