Uploaded image for project: 'CarbonData'
  1. CarbonData
  2. CARBONDATA-3347

support SORT_COLUMNS modification

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • spark-integration
    • None

    Description

      Background

      Now SORT_COLUMNS can’t be modified after the table is created. If we want to modify SORT_COLUMNS on this table, we need to create a new table and migrate data. If the data is huge, the migration will take a long time and even impact the user business.

      SORT_SCOPE in table properties can be modified now. And we can specify new SORT_SCOPE during data loading. Carbon index file will mark whether this segment is sorted or not. So the different segments maybe have different SORT_SCOPE.

      Motivation

      After the table is created, the user can adjust SORT_SCOPE/SORT_COLUMNS according to their business. History segments will still use old SORT_SCOPE/SORT_COLUMNS, but the user also can resort old segments one by one if need.

      But we still suggest the user give a proper SORT_SCOPE/SORT_COLUMNS when they create the table because the modification will take many resources to resort data of old segments.

       

      please check design doc for more detail.

      sort_columns modification_v2.pdf

      Attachments

        1. sort_columns modification_v2.pdf
          34 kB
          David Cai
        2. sort_columns modification.pdf
          34 kB
          David Cai

        Activity

          People

            qiangcai David Cai
            qiangcai David Cai
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 27h
                27h