Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8452

BKD-based shape indexing benchmarks

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • modules/sandbox
    • None
    • New

    Description

      Initial benchmarking of the new BKD-based shape indexing suggest that searches can be somewhat under-performing.   I open this ticket to share the findings and to open a discussion how to speed up the solution.

       

      The first benchmark is done by using the current benchmark in luceneutils for indexing points and search by bounding box. We would expect LatLonShape to be slower that LatLonPoint but still having a good performance. The results of running such benchmark in my computer looks like:

       

      LatLonPoint:

      89.717239531 sec to index

      INDEX SIZE: 0.5087761553004384 GB

      READER MB: 0.6098232269287109

      maxDoc=60844404

      totHits=221118844

      BEST M hits/sec: 72.91056132596746

      BEST QPS: 74.19031323419311 

       

      LatLonShape:

      89.388678805 sec to index

      INDEX SIZE: 1.3028179928660393 GB

      READER MB: 0.8827085494995117

      maxDoc=60844404

      totHits=221118844

      BEST M hits/sec: 1.0053836784184809

      BEST QPS: 1.0230305276205143

       

      A second benchmark has been performed indexing around 10 million 4-side polygons and around 3 million points. Searches are performed using bounding boxes. The results are compared with spatial trees alternatives. Spatial trees use a composite strategy, precision=0.001 degrees and distErrPct=0.25:

       

      s2 (Geo3d):

      1191.732124301 sec to index part 0

      INDEX SIZE: 3.2086284114047885 GB

      READER MB: 19.453557014465332

      maxDoc=12949519

      totHits=705758537

      BEST M hits/sec: 13.311369588840462

      BEST QPS: 4.243743434150063

       

      quad (JTS):

      3252.62925159 sec to index part 0

      INDEX SIZE: 4.5238002222031355 GB

      READER MB: 41.15725612640381

      maxDoc=12949519

      totHits=705758357

      BEST M hits/sec: 35.54591930673003

      BEST QPS: 11.332252412866938

       

      LatLonShape:

      30.32712009 sec to index part 0

      INDEX SIZE: 0.5627057952806354 GB

      READER MB: 0.29498958587646484

      maxDoc=12949519

      totHits=705758228

      BEST M hits/sec: 3.4130465326433357

      BEST QPS: 1.0880999177593018

       

      Attachments

        1. BKDperf.pdf
          52 kB
          Ignacio Vera
        2. Lake.png
          70 kB
          Nick Knize
        3. Park.png
          121 kB
          Nick Knize
        4. River.png
          91 kB
          Nick Knize

        Activity

          People

            Unassigned Unassigned
            ivera Ignacio Vera
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: