Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-7237

Minor Improvements to Schema Handling in Delta Sync

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 0.15.0, 1.0.0
    • None

    Description

      There are a two minor items that we have run into running DeltaStreamer in production.
      1. The number of times the schema is fetched is more than it needs to be and can put unnecessary load on schema providers or increase file system reads

      2. SchemaProviders that return null target schemas on empty batches cause null schema values in commits leading to unexpected issues later

       

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tim.brown Timothy Brown
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: