Uploaded image for project: 'Zeppelin'
  1. Zeppelin
  2. ZEPPELIN-5887

Classloader issue when reading Parquet files.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.10.0
    • None
    • flink
    • None

    Description

      I am trying to read a Parquet file into a table in Zeppelin but it fails with `java.lang.ClassNotFoundException: org.apache.hadoop.conf.Configuration`

      Docker setup/Steps to reproduce:

       

      docker run -u $(id -u) -p 8080:8080 -p 8081:8081 --rm -v /Users/xxx/Downloads/flink-1.13.6:/opt/flink -v /Users/xxx/Downloads/file.parquet:/opt/flink/data.parquet -e FLINK_HOME=/opt/flink --name zeppelin apache/zeppelin:0.10.0

       

      flink-sql-parquet_2.12-1.13.6.jar is added to `/Users/xxx/Downloads/flink-1.13.6`.

      I attempted to include the missing dependency by adding `flink-s3-fs-hadoop-1.13.6.jar` to `/Users/xxx/Downloads/flink-1.13.6` which then uncovers more missing dependencies `java.lang.ClassNotFoundException: org.apache.hadoop.mapreduce.lib.input.FileInputFormat`

      Attachments

        Activity

          People

            phongnt Phong Nguyen
            darenwkt Daren Wong
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: