Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4272

another idea for updatable fields

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None
    • New

    Description

      I've been reviewing the ideas for updatable fields and have an alternative
      proposal that I think would address my biggest concern:

      • not slowing down searching

      When I look at what Solr and Elasticsearch do here, by basically reindexing from stored fields, I think they solve a lot of the problem: users don't have to "rebuild" their document from scratch just to update one tiny piece.

      But I think we can do this more efficiently: by avoiding reindexing of the unaffected fields.

      The basic idea is that we would require term vectors for this approach (as the already store a serialized indexed version of the doc), and so we could just take the other pieces from the existing vectors for the doc.

      I dont think we should discard the idea because vectors are slow/big today, this seems like something we could fix.

      Personally I like the idea of not slowing down search performance to solve the problem, I think we should really start from that angle and work towards making the indexing side more efficient, not vice-versa.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rcmuir Robert Muir
              Votes:
              4 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated: