Uploaded image for project: 'CarbonData'
  1. CarbonData
  2. CARBONDATA-1516

Support pre-aggregate tables and timeseries in carbondata

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.4.0
    • None
    • None

    Description

      Currently Carbondata has standard SQL capability on distributed data sets.Carbondata should support pre-aggregating tables for timeseries and improve query performance.

      Attachments

        1. CarbonData Pre-aggregation Table_v1.1.pdf
          48 kB
          Jacky Li
        2. CarbonData Pre-aggregation Table_v1.2.pdf
          121 kB
          Kumar Vishal
        3. CarbonData Pre-aggregation Table_v1.3.pdf
          135 kB
          Liang Chen
        4. CarbonData Pre-aggregation Table.pdf
          44 kB
          Ravindra Pesala

        Issue Links

          1.
          1. Support CTAS in carbon and support creating aggregation tables using CTAS.And update aggregation table information to main table schema. Sub-task Resolved Kumar Vishal

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 4h 10m
          2.
          2. Support creating timeseries while creating main table. Sub-task Resolved Kumar Vishal

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 7h 50m
          3.
          3. Create UDF for timestamp to extract year,month,day,hour and minute from timestamp and date Sub-task Resolved Kumar Vishal

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 12h 20m
          4.
          4 Load aggregation tables from main table after finish. Sub-task Resolved Kunal Kapoor

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 9h 40m
          5.
          5. Add sql syntax for loading aggregation tables offline. Sub-task Open Unassigned  
          6.
          6. Loading aggregation tables for streaming data tables. Sub-task Resolved Kunal Kapoor

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 9h 40m
          7.
          . Add the API in carbon layer to get suitable aggregation table for group by query. Update query plan in carbon optimizer to support aggregation tables for group by queries. Sub-task Resolved Kumar Vishal

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 9.5h
          8.
          8. Refresh the cache of main table after droping of aggregation table. Sub-task Resolved Kunal Kapoor

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 10h
          9.
          9. Drop all related aggregation tables upon deleting of main table. Sub-task Open Kunal Kapoor  
          10.
          10. Handle compaction in aggregation tables. Sub-task Resolved Kunal Kapoor

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 13h 10m
          11.
          11. Don't allow update/delete and delete segment operations on main table which has aggregation tables. Sub-task Open Kunal Kapoor

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 4h 50m
          12.
          12. Handle alter table scenarios for aggregation table Sub-task Resolved Kunal Kapoor

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 6h
          13.
          13. support describe for aggregation table Sub-task Open Unassigned  
          14.
          Support Desc table and desc formatted table for Pre Aggregate table Sub-task Open Unassigned  
          15.
          Update thrift to support Pre Aggregate support Sub-task Open Unassigned

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 2h 10m
          16.
          Support expression inside aggregate expression in create and load data on Pre aggregate table Sub-task Resolved Kumar Vishal

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 6.5h
          17.
          Support expression inside aggregate expression during query on Pre Aggregate table Sub-task Resolved Kumar Vishal

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 12h
          18.
          Support sub query on Pre Aggregate table Sub-task Resolved Kumar Vishal  
          19.
          Support Query on Pre Aggregate on streaming table Sub-task Open Kumar Vishal

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 10h
          20.
          Support preaggregate table creation on partition tables Sub-task Resolved Kunal Kapoor

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 17h 20m

          Activity

            People

              kumarvishal09 Kumar Vishal
              ravi.pesala Ravindra Pesala
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 135h 10m
                  135h 10m