Uploaded image for project: 'Apache AsterixDB'
  1. Apache AsterixDB
  2. ASTERIXDB-2252

Improve scan efficiency of LSM components

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • STO - Storage

    Description

      The current (full) scan on LSM components is not very efficient, especially on hard disks, in two aspects:

      1. We often need to use a priority queue to merge results from multiple components. However, we only read a page at a time, which incurs a lot of random I/O overhead on hard disks.
      2. Full scan can often fill up (and clean up) the buffer cache. This problem is especially notable when we do merge. After a merge operation, the buffer cache would be filled up pages of old components, which would not be accessed by future queries.

      Attachments

        Activity

          People

            luochen01 Chen Luo
            luochen01 Chen Luo
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: