Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-1214

Need ability to set deltastreamer checkpoints when doing Spark datasource writes

    XMLWordPrintableJSON

Details

    Description

      Such support is needed  for bootstrapping cases when users use spark write to do initial bootstrap and then subsequently use deltastreamer.

      DeltaStreamer manages checkpoints inside hoodie commit files and expects checkpoints in previously committed metadata. Users are expected to pass checkpoint or initial checkpoint provider when performing bootstrap through deltastreamer. Such support is not present when doing bootstrap using Spark Datasource.

      Attachments

        Activity

          People

            Trevorzhang Trevorzhang
            vbalaji Balaji Varadarajan
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: