Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-4066

Conditional mutation processing performance could be improved.

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.6.4, 1.7.0
    • Fix Version/s: 1.6.5, 1.7.1, 1.8.0
    • Component/s: tserver
    • Labels:
      None

      Description

      When processing conditional mutations tablets reads are done. The way the current implementation does tablet reads has a lot of overhead. For each condition the following is done :

      • Opens and reserves iterators files.
      • Parse table iterators from table config (involves scanning and filtering entire table config)
      • Merges condition iterators and table iterators
      • Constructs iterator stack.

      I created a branch where these operations (except for constructing iterator stack) are done per tablet and/or per batch of conditional mutations. Doing this I am seeing a 3x speed up in conditional mutation processing rates when data is cached.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                kturner Keith Turner
                Reporter:
                kturner Keith Turner
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 0.5h
                  0.5h