Uploaded image for project: 'Apache Sedona'
  1. Apache Sedona
  2. SEDONA-349

Drop the support of typed RDDs

    XMLWordPrintableJSON

Details

    • Task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None

    Description

      Types RDDs in Sedona, including PointRDD/PolygonRDD/RectangleRDD/LineStringRDD were introduced in GeoSpark 0.1.

       

      I propose to remove these RDDs.

      1. These typed RDDs are not well designed. For example, they cannot handle GeometryCollections, Multi-XXX objects. They also do not support a dataset that has geometries with mixed types.

       

      2. These code hasn't been touched for a very long time and most people use Sedona SQL API + Sedona generic RDD which is SpatialRDD.

       

      3. The existence of these RDDs make the code maintenance hard especially it involves numerous CRS transform API in the tests and we plan to switch the lat/lon order to lon/lat.

       

      4. We already have good replacement methods for these RDDs. Users can easily create a SpatialRDD using WKBReader, WKTReader, GeoJSONreader, ShapefileReader.

       

      Due to the high volume of the legacy code, I will start with removing redundant constructors of these typedRDDs. In particular, those call analyze(), storageLevel() and crsTransform() inside.

      Attachments

        Issue Links

          Activity

            People

              jiayu Jia Yu
              jiayu Jia Yu
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 40m
                  40m