Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-4314

Use statistics to choose better keys for RFile index

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • None
    • 1.6.6, 1.7.2, 1.8.0
    • None
    • None

    Description

      The commit for ACCUMULO-1124 makes two changes :

      • Generates shorter keys that may not exist in data to place in RFile index
      • Use statistics to make better choices about what keys to place in index. These changes look for keys that are average or below and excludes large keys (keys that are > 3 std dev).

      The change to generate shorter keys can not be made in 1.7.X and 1.6.X because it would generate RFiles that may not work properly with older 1.6 and 1.7 versions. However the changes to use statistics to pick better keys could be made in 1.6 and 1.7.

      Attachments

        Issue Links

          Activity

            People

              kturner Keith Turner
              kturner Keith Turner
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: