Issue Details (XML | Word | Printable)

Key: SOLR-236
Type: New Feature New Feature
Status: Open Open
Priority: Major Major
Assignee: Unassigned
Reporter: Emmanuel Keller
Votes: 49
Watchers: 61
Operations

If you were logged in you would be able to see more operations.
Solr

Field collapsing

Created: 11/May/07 10:13 PM   Updated: Sunday 09:56 PM
Return to search "STDCXX SGI MIPSpro issues"
Component/s: search
Affects Version/s: 1.3
Fix Version/s: 1.5

Time Tracking:
Not Specified

File Attachments:
  Size
Text File Licensed for inclusion in ASF works collapsing-patch-to-1.3.0-dieter.patch 2009-01-29 02:40 PM dieter grad 26 kB
Text File Licensed for inclusion in ASF works collapsing-patch-to-1.3.0-ivan.patch 2008-11-13 04:33 PM Iván de Prado 24 kB
Text File Licensed for inclusion in ASF works collapsing-patch-to-1.3.0-ivan_2.patch 2008-12-10 04:31 PM Iván de Prado 24 kB
Text File Licensed for inclusion in ASF works collapsing-patch-to-1.3.0-ivan_3.patch 2008-12-17 12:43 PM Iván de Prado 24 kB
Text File Licensed for inclusion in ASF works field-collapse-3.patch 2009-07-25 12:58 PM Martijn van Groningen 52 kB
Text File Licensed for inclusion in ASF works field-collapse-4-with-solrj.patch 2009-08-10 08:20 PM Martijn van Groningen 66 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-11-29 09:55 PM Martijn van Groningen 251 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-11-22 10:00 PM Martijn van Groningen 244 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-11-15 08:55 PM Martijn van Groningen 239 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-11-11 06:07 AM Martijn van Groningen 218 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-10-27 04:28 PM Martijn van Groningen 218 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-10-25 10:13 PM Martijn van Groningen 216 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-10-14 09:22 PM Martijn van Groningen 144 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-09-26 03:32 PM Martijn van Groningen 146 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-09-14 08:15 PM Martijn van Groningen 136 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-09-12 03:31 PM Martijn van Groningen 134 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-09-12 11:22 AM Martijn van Groningen 134 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-09-10 11:24 PM Martijn van Groningen 133 kB
Text File Licensed for inclusion in ASF works field-collapse-5.patch 2009-08-24 08:59 PM Martijn van Groningen 122 kB
Text File Licensed for inclusion in ASF works field-collapse-solr-236-2.patch 2009-05-30 11:26 AM Martijn van Groningen 52 kB
Text File Licensed for inclusion in ASF works field-collapse-solr-236.patch 2009-05-29 12:52 PM Martijn van Groningen 49 kB
Text File Licensed for inclusion in ASF works field-collapsing-extended-592129.patch 2007-11-06 10:05 PM Karsten Sperling 31 kB
Text File Licensed for inclusion in ASF works field_collapsing_1.1.0.patch 2007-05-19 02:24 PM Emmanuel Keller 12 kB
Text File Licensed for inclusion in ASF works field_collapsing_1.3.patch 2007-10-28 09:20 PM Emmanuel Keller 14 kB
File Licensed for inclusion in ASF works field_collapsing_dsteigerwald.diff 2008-02-14 11:38 PM Oleg Gnatovskiy 25 kB
File Licensed for inclusion in ASF works field_collapsing_dsteigerwald.diff 2008-01-10 01:17 AM Charles Hornberger 25 kB
File Licensed for inclusion in ASF works field_collapsing_dsteigerwald.diff 2008-01-04 07:40 PM Doug Steigerwald 25 kB
Text File Licensed for inclusion in ASF works quasidistributed.additional.patch 2009-11-10 04:12 PM Michael Gundlach 1 kB
Text File Licensed for inclusion in ASF works SOLR-236-FieldCollapsing.patch 2007-06-27 03:40 PM Emmanuel Keller 18 kB
Text File Licensed for inclusion in ASF works SOLR-236-FieldCollapsing.patch 2007-06-15 06:31 PM Ryan McKinley 18 kB
Text File Licensed for inclusion in ASF works SOLR-236-FieldCollapsing.patch 2007-06-04 02:47 AM Ryan McKinley 16 kB
Text File Licensed for inclusion in ASF works solr-236.patch 2008-06-07 12:36 PM Bojan Smid 24 kB
Text File Licensed for inclusion in ASF works SOLR-236_collapsing.patch 2009-05-06 10:48 PM Thomas Traeger 25 kB
Text File Licensed for inclusion in ASF works SOLR-236_collapsing.patch 2009-03-25 08:27 AM Dmitry Lihachev 26 kB
Issue Links:
Dependants
 
Reference
 


 Description  « Hide
This patch include a new feature called "Field collapsing".

"Used in order to collapse a group of results with similar value for a given field to a single entry in the result set. Site collapsing is a special case of this, where all results for a given web site is collapsed into one or two entries in the result set, typically with an associated "more documents from this site" link. See also Duplicate detection."
http://www.fastsearch.com/glossary.aspx?m=48&amid=299

The implementation add 3 new query parameters (SolrParams):
"collapse.field" to choose the field used to group results
"collapse.type" normal (default value) or adjacent
"collapse.max" to select how many continuous results are allowed before collapsing

TODO (in progress):

  • More documentation (on source code)
  • Test cases

Two patches:

  • "field_collapsing.patch" for current development version
  • "field_collapsing_1.1.0.patch" for Solr-1.1.0

P.S.: Feedback and misspelling correction are welcome



 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Emmanuel Keller made changes - 11/May/07 10:14 PM
Field Original Value New Value
Attachment collapse_field.patch [ 12357145 ]
Emmanuel Keller made changes - 11/May/07 10:48 PM
Attachment collapse_field.patch [ 12357149 ]
Emmanuel Keller made changes - 12/May/07 02:42 PM
Link This issue is related to SOLR-237 [ SOLR-237 ]
Emmanuel Keller made changes - 13/May/07 09:09 PM
Attachment field_collapsing.patch [ 12357195 ]
Emmanuel Keller made changes - 14/May/07 09:27 AM
Attachment field_collapsing.patch [ 12357238 ]
Emmanuel Keller made changes - 14/May/07 05:04 PM
Description This patch include a new feature called "Field collapsing".

"Used in order to collapse a group of results with similar value for a given field to a single entry in the result set. Site collapsing is a special case of this, where all results for a given web site is collapsed into one or two entries in the result set, typically with an associated "more documents from this site" link. See also Duplicate detection."
http://www.fastsearch.com/glossary.aspx?m=48&amid=299

The implementation add 3 new query parameters (SolrParams):
"collapse" set to true to enable collapsing.
"collapse.field" to choose the field used to group results
"collapse.max" to select how many continuous results are allowed before collapsing

TODO (in progress):
- More documentation (on source code)
- Test cases
This patch include a new feature called "Field collapsing".

"Used in order to collapse a group of results with similar value for a given field to a single entry in the result set. Site collapsing is a special case of this, where all results for a given web site is collapsed into one or two entries in the result set, typically with an associated "more documents from this site" link. See also Duplicate detection."
http://www.fastsearch.com/glossary.aspx?m=48&amid=299

The implementation add 4 new query parameters (SolrParams):
"collapse" set to true to enable collapsing.
"collapse.field" to choose the field used to group results
"collapse.type" normal (default value) or adjacent
"collapse.max" to select how many continuous results are allowed before collapsing

TODO (in progress):
- More documentation (on source code)
- Test cases
Emmanuel Keller made changes - 19/May/07 02:19 PM
Attachment field_collapsing.patch [ 12357680 ]
Emmanuel Keller made changes - 19/May/07 02:24 PM
Attachment field_collapsing_1.1.0.patch [ 12357681 ]
Emmanuel Keller made changes - 19/May/07 02:31 PM
Description This patch include a new feature called "Field collapsing".

"Used in order to collapse a group of results with similar value for a given field to a single entry in the result set. Site collapsing is a special case of this, where all results for a given web site is collapsed into one or two entries in the result set, typically with an associated "more documents from this site" link. See also Duplicate detection."
http://www.fastsearch.com/glossary.aspx?m=48&amid=299

The implementation add 4 new query parameters (SolrParams):
"collapse" set to true to enable collapsing.
"collapse.field" to choose the field used to group results
"collapse.type" normal (default value) or adjacent
"collapse.max" to select how many continuous results are allowed before collapsing

TODO (in progress):
- More documentation (on source code)
- Test cases
This patch include a new feature called "Field collapsing".

"Used in order to collapse a group of results with similar value for a given field to a single entry in the result set. Site collapsing is a special case of this, where all results for a given web site is collapsed into one or two entries in the result set, typically with an associated "more documents from this site" link. See also Duplicate detection."
http://www.fastsearch.com/glossary.aspx?m=48&amid=299

The implementation add 3 new query parameters (SolrParams):
"collapse.field" to choose the field used to group results
"collapse.type" normal (default value) or adjacent
"collapse.max" to select how many continuous results are allowed before collapsing

TODO (in progress):
- More documentation (on source code)
- Test cases

Two patches:
- "field_collapsing.patch" for current development version (1.2)
- "field_collapsing_1.1.0.patch" for Solr-1.1.0


P.S.: Feedback and misspelling correction are welcome ;-)
Ryan McKinley made changes - 04/Jun/07 02:47 AM
Attachment SOLR-236-FieldCollapsing.patch [ 12358794 ]
Emmanuel Keller made changes - 04/Jun/07 10:09 PM
Attachment SOLR-236-FieldCollapsing.patch [ 12358906 ]
Emmanuel Keller made changes - 05/Jun/07 10:33 AM
Attachment SOLR-236-FieldCollapsing.patch [ 12358942 ]
Emmanuel Keller made changes - 05/Jun/07 10:33 AM
Attachment collapse_field.patch [ 12357149 ]
Emmanuel Keller made changes - 05/Jun/07 10:33 AM
Attachment collapse_field.patch [ 12357145 ]
Emmanuel Keller made changes - 05/Jun/07 10:33 AM
Attachment field_collapsing.patch [ 12357680 ]
Emmanuel Keller made changes - 05/Jun/07 10:33 AM
Attachment field_collapsing.patch [ 12357238 ]
Emmanuel Keller made changes - 05/Jun/07 10:34 AM
Attachment field_collapsing.patch [ 12357195 ]
Emmanuel Keller made changes - 05/Jun/07 10:34 AM
Attachment SOLR-236-FieldCollapsing.patch [ 12358906 ]
Emmanuel Keller made changes - 11/Jun/07 08:39 AM
Attachment SOLR-236-FieldCollapsing.patch [ 12359403 ]
Emmanuel Keller made changes - 11/Jun/07 08:39 AM
Attachment SOLR-236-FieldCollapsing.patch [ 12358942 ]
Emmanuel Keller made changes - 11/Jun/07 08:50 AM
Description This patch include a new feature called "Field collapsing".

"Used in order to collapse a group of results with similar value for a given field to a single entry in the result set. Site collapsing is a special case of this, where all results for a given web site is collapsed into one or two entries in the result set, typically with an associated "more documents from this site" link. See also Duplicate detection."
http://www.fastsearch.com/glossary.aspx?m=48&amid=299

The implementation add 3 new query parameters (SolrParams):
"collapse.field" to choose the field used to group results
"collapse.type" normal (default value) or adjacent
"collapse.max" to select how many continuous results are allowed before collapsing

TODO (in progress):
- More documentation (on source code)
- Test cases

Two patches:
- "field_collapsing.patch" for current development version (1.2)
- "field_collapsing_1.1.0.patch" for Solr-1.1.0


P.S.: Feedback and misspelling correction are welcome ;-)
This patch include a new feature called "Field collapsing".

"Used in order to collapse a group of results with similar value for a given field to a single entry in the result set. Site collapsing is a special case of this, where all results for a given web site is collapsed into one or two entries in the result set, typically with an associated "more documents from this site" link. See also Duplicate detection."
http://www.fastsearch.com/glossary.aspx?m=48&amid=299

The implementation add 3 new query parameters (SolrParams):
"collapse.field" to choose the field used to group results
"collapse.type" normal (default value) or adjacent
"collapse.max" to select how many continuous results are allowed before collapsing

TODO (in progress):
- More documentation (on source code)
- Test cases

Two patches:
- "field_collapsing.patch" for current development version
- "field_collapsing_1.1.0.patch" for Solr-1.1.0


P.S.: Feedback and misspelling correction are welcome ;-)
Affects Version/s 1.2 [ 12312235 ]
Affects Version/s 1.3 [ 12312486 ]
Ryan McKinley made changes - 15/Jun/07 06:31 PM
Attachment SOLR-236-FieldCollapsing.patch [ 12359902 ]
Emmanuel Keller made changes - 18/Jun/07 05:37 PM
Attachment SOLR-236-FieldCollapsing.patch [ 12359403 ]
Emmanuel Keller made changes - 27/Jun/07 03:40 PM
Attachment SOLR-236-FieldCollapsing.patch [ 12360676 ]
Emmanuel Keller made changes - 28/Oct/07 08:54 PM
Attachment field_collapsing_1.3.patch [ 12368569 ]
Emmanuel Keller made changes - 28/Oct/07 09:20 PM
Attachment field_collapsing_1.3.patch [ 12368570 ]
Emmanuel Keller made changes - 28/Oct/07 09:20 PM
Attachment field_collapsing_1.3.patch [ 12368569 ]
Karsten Sperling made changes - 06/Nov/07 10:06 PM
Attachment field-collapsing-extended-592129.patch [ 12369053 ]
Ryan McKinley made changes - 09/Nov/07 01:25 AM
Link This issue depends on SOLR-281 [ SOLR-281 ]
Doug Steigerwald made changes - 04/Jan/08 07:40 PM
Attachment field_collapsing_dsteigerwald.diff [ 12372522 ]
Charles Hornberger made changes - 10/Jan/08 01:17 AM
Attachment field_collapsing_dsteigerwald.diff [ 12372861 ]
Oleg Gnatovskiy made changes - 14/Feb/08 11:38 PM
Attachment field_collapsing_dsteigerwald.diff [ 12375643 ]
Otis Gospodnetic made changes - 22/May/08 04:26 PM
Assignee Otis Gospodnetic [ otis ]
Otis Gospodnetic made changes - 25/May/08 02:04 PM
Comment [ Hi / Hallo,

Thanks for your mail. Unfortunately, I won't be able to answer it
soon. I'm on vacation till June 2nd without access to my mails.

~~~~

Vielen Dank für die Email. Leider werde ich nicht sofort antworten.
Ich bin bis 2. Juni im Urlaub ohne Zugriff auf mein Postfach.

-Gunnar

--
Gunnar Wagenknecht
gunnar@wagenknecht.org
http://wagenknecht.org/
]
Bojan Smid made changes - 07/Jun/08 12:36 PM
Attachment solr-236.patch [ 12383622 ]
Muddassir hasan made changes - 31/Jul/08 10:01 AM
Comment [ I tried to use this patch but i could make it work for me. I compiled solr with patch applied and added following to my solrconf : <searchComponent name="collapse" class="org.apache.solr.handler.component.CollapseComponent" /> and
<requestHandler name="/search" class="org.apache.solr.handler.component.SearchHandler">
    <arr name="components">
      <str>collapse</str>
    </arr>
</requestHandler>

It started perfectly but i could not find any collapse on using
collapse.field=key_string11&collapse.type=normal&collapse.max=1&collapse=true

If i m missing out something. plz let me know my mistake. ]
Shalin Shekhar Mangar made changes - 19/Aug/08 08:24 PM
Fix Version/s 1.4 [ 12313351 ]
Otis Gospodnetic made changes - 09/Nov/08 09:11 PM
Link This issue is related to LUCENE-1421 [ LUCENE-1421 ]
Otis Gospodnetic made changes - 13/Nov/08 04:45 AM
Assignee Otis Gospodnetic [ otis ]
Iván de Prado made changes - 13/Nov/08 04:33 PM
Attachment collapsing-patch-to-1.3.0-ivan.patch [ 12393877 ]
Iván de Prado made changes - 10/Dec/08 04:31 PM
Attachment collapsing-patch-to-1.3.0-ivan_2.patch [ 12395742 ]
Iván de Prado made changes - 17/Dec/08 12:43 PM
Attachment collapsing-patch-to-1.3.0-ivan_3.patch [ 12396300 ]
dieter grad made changes - 29/Jan/09 02:40 PM
Attachment collapsing-patch-to-1.3.0-dieter.patch [ 12399009 ]
Shalin Shekhar Mangar made changes - 17/Feb/09 07:29 AM
Fix Version/s 1.5 [ 12313566 ]
Fix Version/s 1.4 [ 12313351 ]
Dmitry Lihachev made changes - 25/Mar/09 08:26 AM
Attachment SOLR-236_collapsing.patch [ 12403589 ]
Dmitry Lihachev made changes - 25/Mar/09 08:27 AM
Attachment SOLR-236_collapsing.patch [ 12403589 ]
Dmitry Lihachev made changes - 25/Mar/09 08:27 AM
Attachment SOLR-236_collapsing.patch [ 12403590 ]
Domingo Gómez García made changes - 05/May/09 09:26 AM
Comment [ The results of collapse_counts are not what i have expected. It losses many categories, only showing a few . I tried incrementing the collapse.max parameter:

max=1 results

<lst name="doc">
<int name="2008/LICOBLE-00023">109</int>
<int name="2008/LICOBLE-3">5</int>
<int name="2009/LICOBLE-00036">4</int>
<int name="2009/LICOBLE-00095">1</int>
</lst>

<lst name="count">
<int name="12740">109</int>
<int name="12741">5</int>
<int name="13282">4</int>
<int>1</int>
</lst>


max=2 results

<lst name="doc">
<int name="2009/LICOBLE-00008">108</int>
<int name="2007/LICOBLE-1">4</int>
</lst>

<lst name="count">
<int name="12740">108</int>
<int name="12741">4</int>
</lst>


max=3 results

<lst name="doc">
<int name="2008/LICOBLE-00020">107</int>
<int name="2008/LICOBLE-00021">3</int>
</lst>

<lst name="count">
<int name="12740">107</int>
<int name="12741">3</int>
</lst>


max=4

<lst name="doc">
<int name="2009/LICOBLE-00060">106</int>
</lst>

<lst name="count">
<int name="12740">106</int>
</lst>

How is possible to get less results each time? There are like 70 categories, do I have any way to obtain all those counts? Am I mising any collapsing concept?
Thanks. ]
Thomas Traeger made changes - 06/May/09 10:48 PM
Attachment SOLR-236_collapsing.patch [ 12407410 ]
Martijn van Groningen made changes - 29/May/09 12:52 PM
Attachment field-collapse-solr-236.patch [ 12409377 ]
Martijn van Groningen made changes - 30/May/09 11:26 AM
Attachment field-collapse-solr-236-2.patch [ 12409459 ]
Martijn van Groningen made changes - 25/Jul/09 12:58 PM
Attachment field-collapse-3.patch [ 12414513 ]
Hoss Man made changes - 08/Aug/09 06:12 AM
Link This issue is related to SOLR-1311 [ SOLR-1311 ]
Martijn van Groningen made changes - 10/Aug/09 08:20 PM
Attachment field-collapse-4-with-solrj.patch [ 12416108 ]
Martijn van Groningen made changes - 24/Aug/09 08:59 PM
Attachment field-collapse-5.patch [ 12417524 ]
Martijn van Groningen made changes - 10/Sep/09 11:24 PM
Attachment field-collapse-5.patch [ 12419237 ]
Martijn van Groningen made changes - 12/Sep/09 11:22 AM
Attachment field-collapse-5.patch [ 12419374 ]
Martijn van Groningen made changes - 12/Sep/09 03:31 PM
Attachment field-collapse-5.patch [ 12419386 ]
Martijn van Groningen made changes - 14/Sep/09 08:15 PM
Attachment field-collapse-5.patch [ 12419560 ]
Martijn van Groningen made changes - 26/Sep/09 03:32 PM
Attachment field-collapse-5.patch [ 12420627 ]
Martijn van Groningen made changes - 14/Oct/09 09:22 PM
Attachment field-collapse-5.patch [ 12422142 ]
Martijn van Groningen made changes - 25/Oct/09 10:13 PM
Attachment field-collapse-5.patch [ 12423156 ]
Martijn van Groningen made changes - 27/Oct/09 04:28 PM
Attachment field-collapse-5.patch [ 12423331 ]
Michael Gundlach made changes - 10/Nov/09 04:12 PM
Attachment quasidistributed.additional.patch [ 12424470 ]
Martijn van Groningen made changes - 11/Nov/09 06:07 AM
Attachment field-collapse-5.patch [ 12424567 ]
Martijn van Groningen made changes - 15/Nov/09 08:55 PM
Attachment field-collapse-5.patch [ 12425030 ]
Martijn van Groningen made changes - 22/Nov/09 10:00 PM
Attachment field-collapse-5.patch [ 12425775 ]
Martijn van Groningen made changes - 29/Nov/09 09:55 PM
Attachment field-collapse-5.patch [ 12426370 ]