Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-1295

Implement: Metadata based bloom index - write path

    XMLWordPrintableJSON

Details

    Description

      Idea here to maintain our bloom filters outside of parquet for speedier access from bloom.

       

      • Design and impl bloom filter migration to metadata table. 

      Design:

      schema for the payload: 

      key: partitionName_fileName

      payload schema:

      isDeleted (boolean): true/false

      bloom_type: short

      ser_bloom: byte[] representing serialized bloom filter. 

       

       

       

      Attachments

        Issue Links

          Activity

            People

              manojg Manoj Govindassamy
              vinoth Vinoth Chandar
              Vinoth Chandar
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: