Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: 1.3
    • Fix Version/s: 3.3
    • Component/s: search
    • Labels:
      None

      Description

      This patch include a new feature called "Field collapsing".

      "Used in order to collapse a group of results with similar value for a given field to a single entry in the result set. Site collapsing is a special case of this, where all results for a given web site is collapsed into one or two entries in the result set, typically with an associated "more documents from this site" link. See also Duplicate detection."
      http://www.fastsearch.com/glossary.aspx?m=48&amid=299

      The implementation add 3 new query parameters (SolrParams):
      "collapse.field" to choose the field used to group results
      "collapse.type" normal (default value) or adjacent
      "collapse.max" to select how many continuous results are allowed before collapsing

      TODO (in progress):

      • More documentation (on source code)
      • Test cases

      Two patches:

      • "field_collapsing.patch" for current development version
      • "field_collapsing_1.1.0.patch" for Solr-1.1.0

      P.S.: Feedback and misspelling correction are welcome

        Attachments

        1. SOLR-236-trunk.patch
          236 kB
          Martijn van Groningen
        2. SOLR-236-trunk.patch
          247 kB
          Martijn van Groningen
        3. SOLR-236-trunk.patch
          250 kB
          Martijn van Groningen
        4. SOLR-236-trunk.patch
          256 kB
          Martijn van Groningen
        5. SOLR-236-trunk.patch
          259 kB
          Martijn van Groningen
        6. SOLR-236-FieldCollapsing.patch
          16 kB
          Ryan McKinley
        7. SOLR-236-FieldCollapsing.patch
          18 kB
          Ryan McKinley
        8. SOLR-236-FieldCollapsing.patch
          18 kB
          Emmanuel Keller
        9. SOLR-236-distinctFacet.patch
          2 kB
          Bill Bell
        10. SOLR-236-branch_3x.patch
          258 kB
          Doug Steigerwald
        11. SOLR-236-1_4_1-paging-totals-working.patch
          264 kB
          Stephen Weiss
        12. SOLR-236-1_4_1-NPEfix.patch
          0.7 kB
          Cameron
        13. SOLR-236-1_4_1.patch
          264 kB
          Martijn van Groningen
        14. SOLR-236.patch
          253 kB
          Shalin Shekhar Mangar
        15. SOLR-236.patch
          245 kB
          Martijn van Groningen
        16. SOLR-236.patch
          257 kB
          Shalin Shekhar Mangar
        17. SOLR-236.patch
          251 kB
          Martijn van Groningen
        18. SOLR-236.patch
          252 kB
          Shalin Shekhar Mangar
        19. SOLR-236.patch
          244 kB
          Martijn van Groningen
        20. SOLR-236.patch
          245 kB
          Martijn van Groningen
        21. SOLR-236.patch
          27 kB
          Yonik Seeley
        22. solr-236.patch
          24 kB
          Bojan Smid
        23. SOLR-236_collapsing.patch
          26 kB
          Dmitry Lihachev
        24. SOLR-236_collapsing.patch
          25 kB
          Thomas Traeger
        25. quasidistributed.additional.patch
          1 kB
          Michael Gundlach
        26. NonAdjacentDocumentCollapserTest.java
          9 kB
          Peter Karich
        27. NonAdjacentDocumentCollapser.java
          21 kB
          Peter Karich
        28. field-collapsing-extended-592129.patch
          31 kB
          Karsten Sperling
        29. field-collapse-solr-236-2.patch
          52 kB
          Martijn van Groningen
        30. field-collapse-solr-236.patch
          49 kB
          Martijn van Groningen
        31. field-collapse-5.patch
          122 kB
          Martijn van Groningen
        32. field-collapse-5.patch
          133 kB
          Martijn van Groningen
        33. field-collapse-5.patch
          134 kB
          Martijn van Groningen
        34. field-collapse-5.patch
          134 kB
          Martijn van Groningen
        35. field-collapse-5.patch
          136 kB
          Martijn van Groningen
        36. field-collapse-5.patch
          146 kB
          Martijn van Groningen
        37. field-collapse-5.patch
          144 kB
          Martijn van Groningen
        38. field-collapse-5.patch
          216 kB
          Martijn van Groningen
        39. field-collapse-5.patch
          218 kB
          Martijn van Groningen
        40. field-collapse-5.patch
          218 kB
          Martijn van Groningen
        41. field-collapse-5.patch
          239 kB
          Martijn van Groningen
        42. field-collapse-5.patch
          244 kB
          Martijn van Groningen
        43. field-collapse-5.patch
          251 kB
          Martijn van Groningen
        44. field-collapse-5.patch
          253 kB
          Martijn van Groningen
        45. field-collapse-5.patch
          254 kB
          Martijn van Groningen
        46. field-collapse-4-with-solrj.patch
          66 kB
          Martijn van Groningen
        47. field-collapse-3.patch
          52 kB
          Martijn van Groningen
        48. field_collapsing_dsteigerwald.diff
          25 kB
          Doug Steigerwald
        49. field_collapsing_dsteigerwald.diff
          25 kB
          Charles Hornberger
        50. field_collapsing_dsteigerwald.diff
          25 kB
          Oleg Gnatovskiy
        51. field_collapsing_1.3.patch
          14 kB
          Emmanuel Keller
        52. field_collapsing_1.1.0.patch
          12 kB
          Emmanuel Keller
        53. DocSetScoreCollector.java
          5 kB
          Peter Karich
        54. collapsing-patch-to-1.3.0-ivan.patch
          24 kB
          Iván de Prado
        55. collapsing-patch-to-1.3.0-ivan_3.patch
          24 kB
          Iván de Prado
        56. collapsing-patch-to-1.3.0-ivan_2.patch
          24 kB
          Iván de Prado
        57. collapsing-patch-to-1.3.0-dieter.patch
          26 kB
          dieter grad

          Issue Links

          1.
          Provide an API to specify custom Collectors Sub-task Resolved Unassigned
          2.
          Fieldcollapse SolrJ code Sub-task Closed Unassigned
          3.
          Implement CollapseComponent Sub-task Closed Shalin Shekhar Mangar
          4.
          Distributed field collapsing Sub-task Closed Unassigned
          5.
          Refactor QueryComponent for easy extensibility Sub-task Resolved Shalin Shekhar Mangar
          6.
          Support fixing the number of shards in BaseDistributedTestCase Sub-task Resolved Shalin Shekhar Mangar
          7.
          Search Grouping: single doclist format Sub-task Resolved Unassigned
          8.
          Search Grouping: support highlighting Sub-task Closed Unassigned
          9.
          Search Grouping: support explain (debugQuery) Sub-task Resolved Unassigned
          10.
          Search Grouping: support distributed search Sub-task Closed Unassigned
          11.
          Search Grouping: CSV response writer Sub-task Open Unassigned
          12.
          Search Grouping: collapse by string specialization Sub-task Closed Unassigned
          13.
          Search Grouping: intermediate caches Sub-task Open Unassigned
          14.
          Search Grouping: single pass implementation Sub-task Open Unassigned
          15.
          Search Grouping: unlikely collision implementation Sub-task Open Unassigned
          16.
          Search Grouping: expand group sort options Sub-task Open Unassigned
          17.
          Search Grouping: SolrJ support Sub-task Resolved Unassigned
          18.
          Search Grouping: Facet support Sub-task Closed Unassigned
          19.
          Search Grouping: Group by query (like facet.query) Sub-task Resolved Unassigned
          20.
          Add grouping support to Velocity UI Sub-task Resolved Erik Hatcher
          21.
          Externalizing groupValue values Sub-task Closed Unassigned
          22.
          Grouping treats null values as equivalent to 0 or an empty string Sub-task Resolved Unassigned
          23.
          Grouping performance improvements Sub-task Closed Unassigned
          24.
          Search Grouping: random testing Sub-task Resolved Unassigned

            Activity

              People

              • Assignee:
                shalinmangar Shalin Shekhar Mangar
                Reporter:
                ekeller Emmanuel Keller
              • Votes:
                76 Vote for this issue
                Watchers:
                77 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: