Uploaded image for project: 'Apache Hudi'
  1. Apache Hudi
  2. HUDI-4855

Bootstrap table from Deltastreamer cannot be read in Spark

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

       

      scala> val df = spark.read.format("hudi").load("<bootstrap_table>")
      org.apache.hudi.exception.HoodieException: No files found for reading in user provided path.
        at org.apache.hudi.HoodieBootstrapRelation.buildFileIndex(HoodieBootstrapRelation.scala:167)
        at org.apache.hudi.HoodieBootstrapRelation.<init>(HoodieBootstrapRelation.scala:65)
        at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:144)
        at org.apache.hudi.DefaultSource.createRelation(DefaultSource.scala:68)
        at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:350)
        at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:274)
        at org.apache.spark.sql.DataFrameReader.$anonfun$load$3(DataFrameReader.scala:245)
        at scala.Option.getOrElse(Option.scala:189)
        at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:245)
        at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:188)
        ... 47 elided
      
      

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            codope Sagar Sumit
            guoyihua Ethan Guo
            Sagar Sumit, Shiyan Xu
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Agile

                Completed Sprints:
                2022/09/05 ended 19/Sep/22
                2022/09/19 ended 04/Oct/22
                2022/10/04 ended 19/Oct/22
                View on Board

                Slack

                  Issue deployment