Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-3758

Optimize flink partition table with BucketIndex

    XMLWordPrintableJSON

Details

    Description

      When using flink bucket index , I meet two problems

      • without use all streamWriter tasks when partition table with small Bucket number
      • crashed with the following step
      1. start job
      2. killed before first commit success ( left some log files)
      3. restart job run nomal after one successful commit
      4. kill job and restart  throws `Duplicate fileID 00000001-6f57-4c71-bf6f-ee7616ec7b14 from bucket 1 of partition  found during the BucketStreamWriteFunction index bootstrap`

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              wukong konwu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: