Uploaded image for project: 'CarbonData'
  1. CarbonData
  2. CARBONDATA-2632

BloomFilter DataMap Bugs and Optimization

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.4.1
    • None
    • None

    Description

      This is an umbrella Jira for bloomfilter bugs

      Attachments

        1.
        Bugs are found when bloomindex column is dictionary/sort/date column Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 7h 40m
        2.
        Provide more information about the datamap when showing datamaps Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 3h 40m
        3.
        Support different provider based index datamaps on same column Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 2h 40m
        4.
        Fix bugs for deferred rebuild for bloomfilter datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 7.5h
        5.
        optimize blocklet pruning for bloomfilter Sub-task Open Unassigned  
        6.
        explain query shows negative skipped blocklets for bloomfilter datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 7h 40m
        7.
        Pipelined blocklet pruning for index datamaps Sub-task Open Unassigned  
        8.
        Fix bugs in incorrect blocklet number in bloomfilter Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 2h 50m
        9.
        Optimize output for explaining query with datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 6.5h
        10.
        Support `in` operator for bloomfilter datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 3.5h
        11.
        Loading/Filtering empty value fails on bloom index columns Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 7h 50m
        12.
        Support filtering on longstring bloom index columns Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 5h 40m
        13.
        make datamap rebuild for all segments in parallel Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 6h 20m
        14.
        update document for bloomfilter Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 1h
        15.
        Fix bug for alter rename is renameing the existing table on which bloomfilter datamp exists Sub-task Resolved wangsen

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 6h 40m
        16.
        Block alter datatype of bloom index columns Sub-task Resolved lianganping

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 9h 20m
        17.
        Whether to support altering name for bloom index columns Sub-task Resolved Unassigned  
        18.
        Block dropping index columns for index datamap Sub-task Resolved lianganping

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 1.5h
        19.
        Fix bugs in clear bloom datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 2.5h
        20.
        clear bloom index file after segment is deleted Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 2h 40m
        21.
        clear index file if dataloading is failed Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 5h 20m
        22.
        Refactor and move ReadSupport in IndexDataMapRebuildRDD to CarbonCore Sub-task Open Unassigned  
        23.
        Add validate for datamap writer while loading data Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 2h 40m
        24.
        Table update/delete is needed block on table having datamaps Sub-task Resolved wangsen

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 2h 50m
        25.
        Failed to recreate the table which has bloomfilter on it with same table name but different bloom index Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 3.5h
        26.
        Support create bloom datamap on newly added column Sub-task Resolved Manhua Jiang

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 6h 50m
        27.
        Block create bloomfilter datamap index on local_dictionary column Sub-task Closed lianganping  
        28.
        Block create bloomfilter datamap index on column which its datatype is complex type Sub-task Resolved lianganping  
        29.
        Fix bug for getting datamap file when table has multiple datamaps Sub-task Resolved Manhua Jiang

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 4.5h
        30.
        Fix bug when building bloomfilter on measure column Sub-task Resolved Manhua Jiang

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 3h
        31.
        Optimize code to get blocklet id when rebuilding datamap Sub-task Resolved Manhua Jiang

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 2h
        32.
        Exception should be thrown if expression do not satisfy bloomFilter's requirement Sub-task Resolved Manhua Jiang

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 1h
        33.
        Update document of bloom filter datamap Sub-task Resolved Manhua Jiang

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 50m
        34.
        Optimize default parameter for bloomfilter datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 4.5h
        35.
        Fix bugs in incorrect query result with bloom datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 13h 20m
        36.
        Add useful tips for bloomfilter datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 3h
        37.
        Add query test case using search mode on table with bloom filter Sub-task Resolved Manhua Jiang

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 4.5h
        38.
        Merge bloom index files of multi-shards for each index column Sub-task Resolved Manhua Jiang

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 10h
        39.
        Fix bug in bloom index on multiple dictionary columns Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 5h 20m
        40.
        add sdv test case for bloomfilter datamap Sub-task Resolved Chuanyin Xu

        100%

        Original Estimate - Not Specified Original Estimate - Not Specified
        Time Spent - 1h 50m

        Activity

          People

            xuchuanyin Chuanyin Xu
            xuchuanyin Chuanyin Xu
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 160h 50m
                160h 50m