Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-7122

Implement script to compare lucene indexes logically

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Done
    • None
    • 1.8.0
    • run
    • None

    Description

      With Document Traversal based indexing we have implemented a newer indexing logic. To validate that index produced by it is is same as one done by existing indexing flow we need to implement a script which can enable comparing the index content logically

      This was recently discussed on lucene mailing list [1] and suggestion there was it can be done by un-inverting the index. So to enable that we need to implement a script which can

      1. Open a Lucene index
      2. Map the Lucene Document to path of node
      3. For each document determine what all fields are associated with it (stored and non stored)
      4. Dump this content in file sorted by path and for each line field name sorted by name

      Then such dumps can be generated for old and new index and compared via simple text diff

      [1] http://lucene.markmail.org/thread/wt22gk6aufs4uz55

      Attachments

        Issue Links

          Activity

            People

              chetanm Chetan Mehrotra
              chetanm Chetan Mehrotra
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: