Uploaded image for project: 'Apache Sedona'
  1. Apache Sedona
  2. SEDONA-222

GeoParquet reader does not work in non-local mode

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.3.0, 1.3.1
    • 1.4.0
    • None

    Description

      GeoParquet reader works fine when loading one of the example parquet files in local mode:

      scala> spark.read.format("geoparquet").load("/path/to/example1.parquet").printSchema
      root                                                                            
       |-- pop_est: long (nullable = true)
       |-- continent: string (nullable = true)
       |-- name: string (nullable = true)
       |-- iso_a3: string (nullable = true)
       |-- gdp_md_est: double (nullable = true)
       |-- geometry: geometry (nullable = true)
      

      When running the same code in standalone cluster mode, the type of the geometry column is binary instead of geometry:

      scala> spark.read.format("geoparquet").load("/path/to/example1.parquet").printSchema
      root                                                                            
       |-- pop_est: long (nullable = true)
       |-- continent: string (nullable = true)
       |-- name: string (nullable = true)
       |-- iso_a3: string (nullable = true)
       |-- gdp_md_est: double (nullable = true)
       |-- geometry: binary (nullable = true)
      

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              kontinuation Kristin Cowalcijk
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: