Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-4615

Fix empty commits being made by deltastreamer with S3EventsSource when there is no data in SQS on starting a new pipeline

    XMLWordPrintableJSON

Details

    • 0.5

    Description

      When we start a new deltastreamer with S3EventsSource, checkpoint is Option.empty(). After consumption from source, if there is no data, the source returns "val=0" as the checkpoint. So, deltastreamer assumes checkpoint has changed and makes an empty commit. This needs fixing. 

       

      https://github.com/apache/hudi/blob/0d0a4152cfd362185066519ae926ac4513c7a152/hudi-utilities/src/main/java/org/apache/hudi/utilities/sources/helpers/S3EventsMetaSelector.java#L151

       

      Attachments

        Issue Links

          Activity

            People

              vinish_jail97 Vinish Reddy
              shivnarayan sivabalan narayanan
              sivabalan narayanan
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: