Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-3844

HoodieIndexer should set existing MDT partitions in props if not already set by user

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • None
    • 0.11.0
    • None

    Description

      Currently, the indexer assumes that only those partitions that are set by user (in props passed to the indexer) are enabled and then goes ahead and deletes the other partitions (except FILES) while fetching metadata writer.

      For instance, let's say ingestion writer had metadata enabled (and hence the FILES partition) and also BLOOM_FILTERS index was enabled. Do some commits and it will as usual create files and bloom_filters partition. Now, user wants to create COLUMN_STATS index using the indexer and hence enabled metadata and column_stats index in props passed to the indexer. In this scenario, indexer will presume that only files and column_stats are enabled, while bloom_filters is disabled and the call table.getMetadataWriter() will think bloom_filters needs to be removed, which is wrong. 

      Indexer should not presume which indexes (or MDT partitions) are disabled. Instead, it should update its props based on table config. If a partition exists due to regular writers, it should not delete that partition.

      Attachments

        Issue Links

          Activity

            People

              codope Sagar Sumit
              codope Sagar Sumit
              sivabalan narayanan
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: