Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-7122

Implement script to compare lucene indexes logically

    XMLWordPrintableJSON

    Details

    • Type: Task
    • Status: Closed
    • Priority: Major
    • Resolution: Done
    • Affects Version/s: None
    • Fix Version/s: 1.8.0
    • Component/s: run
    • Labels:
      None

      Description

      With Document Traversal based indexing we have implemented a newer indexing logic. To validate that index produced by it is is same as one done by existing indexing flow we need to implement a script which can enable comparing the index content logically

      This was recently discussed on lucene mailing list [1] and suggestion there was it can be done by un-inverting the index. So to enable that we need to implement a script which can

      1. Open a Lucene index
      2. Map the Lucene Document to path of node
      3. For each document determine what all fields are associated with it (stored and non stored)
      4. Dump this content in file sorted by path and for each line field name sorted by name

      Then such dumps can be generated for old and new index and compared via simple text diff

      [1] http://lucene.markmail.org/thread/wt22gk6aufs4uz55

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                chetanm Chetan Mehrotra
                Reporter:
                chetanm Chetan Mehrotra
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: