Details
-
New Feature
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
-
New
Description
I've been reviewing the ideas for updatable fields and have an alternative
proposal that I think would address my biggest concern:
- not slowing down searching
When I look at what Solr and Elasticsearch do here, by basically reindexing from stored fields, I think they solve a lot of the problem: users don't have to "rebuild" their document from scratch just to update one tiny piece.
But I think we can do this more efficiently: by avoiding reindexing of the unaffected fields.
The basic idea is that we would require term vectors for this approach (as the already store a serialized indexed version of the doc), and so we could just take the other pieces from the existing vectors for the doc.
I dont think we should discard the idea because vectors are slow/big today, this seems like something we could fix.
Personally I like the idea of not slowing down search performance to solve the problem, I think we should really start from that angle and work towards making the indexing side more efficient, not vice-versa.
Attachments
Issue Links
- depends upon
-
LUCENE-1888 Provide Option to Store Payloads on the Term Vector
- Closed
- is related to
-
LUCENE-4258 Incremental Field Updates through Stacked Segments
- Reopened
- relates to
-
LUCENE-4599 Compressed term vectors
- Closed