[LUCENE-5527] Make the Collector API work per-segment - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 5.0, 6.0
Component/s: None
Labels:
None

Lucene Fields:

New

Description

Spin-off of ~~LUCENE-5299~~.

LUCENE-5229 proposes different changes, some of them being controversial, but there is one of them that I really really like that consists in refactoring the Collector API in order to have a different Collector per segment.

The idea is, instead of having a single Collector object that needs to be able to take care of all segments, to have a top-level Collector:

public interface Collector {

  AtomicCollector setNextReader(AtomicReaderContext context) throws IOException;
  
}

and a per-AtomicReaderContext collector:

public interface AtomicCollector {

  void setScorer(Scorer scorer) throws IOException;

  void collect(int doc) throws IOException;

  boolean acceptsDocsOutOfOrder();

}

I think it makes the API clearer since it is now obious setScorer and acceptDocsOutOfOrder need to be called after setNextReader which is otherwise unclear.

It also makes things more flexible. For example, a collector could much more easily decide to use different strategies on different segments. In particular, it makes the early-termination collector much cleaner since it can return different atomic collectors implementations depending on whether the current segment is sorted or not.

Even if we have lots of collectors all over the place, we could make it easier to migrate by having a Collector that would implement both Collector and AtomicCollector, return this in setNextReader and make current concrete Collector implementations extend this class instead of directly extending Collector.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LUCENE-5527.patch
03/Apr/14 14:45
201 kB
Adrien Grand
LUCENE-5527.patch
03/Apr/14 23:22
205 kB
Adrien Grand

Activity

People

Assignee:: Adrien Grand

Reporter:: Adrien Grand

Votes:: 0 Vote for this issue

Watchers:: 10 Start watching this issue

Dates

Created:: 14/Mar/14 16:33

Updated:: 28/Aug/22 14:02

Resolved:: 04/Apr/14 16:17

Agile

View on Board

Make the Collector API work per-segment

Details

Description

Attachments

Attachments

Activity

People

Dates

Agile

Slack

Issue deployment