Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10156

Add significantTerms Streaming Expression

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Resolved
    • None
    • 6.5
    • None
    • None

    Description

      The significantTerms Streaming Expression will emit a set of terms from a text field within a doc frequency range for a specific query. It will also score the terms based on how many times the terms appear in the result set, and how many times the terms appear in the corpus, and return the top N terms based on this significance score.

      Syntax:

      significantTerms(collection, 
                       q="any query", 
                       field="some_text_field", 
                       minDocFreq="5",   //optional default is 5 documents
                       maxDocFreq=".3", // optional default is no more then 30% of the index (.3)
                       minTermLength="4",  // optional default is 4
                       limit="50")                // optional default is 20
      

      Attachments

        1. SOLR-10156.patch
          23 kB
          Joel Bernstein
        2. SOLR-10156.patch
          30 kB
          Joel Bernstein
        3. SOLR-10156.patch
          36 kB
          Joel Bernstein

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jbernste Joel Bernstein
            jbernste Joel Bernstein
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment