Uploaded image for project: 'Kudu'
  1. Kudu
  2. KUDU-1967 Umbrella JIRA for node density improvements
  3. KUDU-1971

Explore reducing number of data blocks by tuning existing parameters

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.4.0
    • None
    • tablet

    Description

      One way to scale to larger on-disk data sets is to reduce the ratio between data blocks and data; that is, to make data blocks larger. Two existing parameters control for this:

      • budgeted_compaction_target_rowset_size: within a given flush or compaction operation, stipulates the size of each rowset. Currently 32M.
      • tablet_compaction_budget_mb: stipulates the amount of data that should be included in any given compaction. Currently 128M.

      It might be interesting to explore raising these.

      Attachments

        Activity

          People

            Unassigned Unassigned
            adar Adar Dembo
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: