Uploaded image for project: 'Apache Sedona'
  1. Apache Sedona
  2. SEDONA-669

GeoParquet format should handle timestamp_ntz columns properly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.7.0

    Description

      A user reported this problem:

      In Sedona 1.6.1 (pyspark 3.5), I can write columns of type TimestampNTZType() in parquet format, but if I try writing as a geoparquet it crashes with:

      java.lang.RuntimeException: Unsupported data type TimestampNTZType.
      

      The GeoParquet code in sedona was based on the Parquet support code in Spark 3.3, which did not have TimestampNTZType. We can fix this problem by adding TimestampNTZ support to sedona-spark 3.4 and 3.5.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              kontinuation Kristin Cowalcijk
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m