Uploaded image for project: 'Phoenix'
  1. Phoenix
  2. PHOENIX-5494

Batched, mutable Index updates are unnecessarily run one-by-one

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 4.15.0, 5.1.0
    • 4.15.0, 5.1.0
    • None

    Description

      I just noticed that index updates on mutable tables retrieve their deletes (to invalidate the old index entry) one-by-one.

      For batches, this can be the major time spent during an index update. The cost is mostly incured by the repeated setup (and seeking) of the new region scanner (for each row).

      We can instead do a skip scan and get all updates in a single scan per region.

      (Logically that is simple, but it will require some refactoring)

      I won't be getting to this, but recording it here in case someone feels inclined.

      Attachments

        1. PHOENIX-5494_v9-master.patch
          43 kB
          chenglei
        2. PHOENIX-5494_v9-4.x-HBase-1.4.patch
          42 kB
          chenglei
        3. PHOENIX-5494-4.x-HBase-1.4.patch
          42 kB
          chenglei
        4. 5494-4.x-HBase-1.5.txt
          14 kB
          Lars Hofhansl
        5. PHOENIX-5494.master.003.patch
          16 kB
          Kadir OZDEMIR
        6. PHOENIX-5494.master.002.patch
          15 kB
          Kadir OZDEMIR
        7. PHOENIX-5494.master.001.patch
          15 kB
          Kadir OZDEMIR
        8. Screenshot_20191110_161453.png
          62 kB
          Lars Hofhansl
        9. Screenshot_20191110_160351.png
          46 kB
          Lars Hofhansl
        10. Screenshot_20191110_160243.png
          34 kB
          Lars Hofhansl

        Activity

          People

            comnetwork chenglei
            larsh Lars Hofhansl
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 5h 50m
                5h 50m