Uploaded image for project: 'Apache Sedona'
  1. Apache Sedona
  2. SEDONA-18

error reading shapefile

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.0.0
    • 1.2.0

    Description

      Get the following error when calling:

      ShapefileReader.readToGeometryRDD(spark.sparkContext, path)
      

       

      java.io.IOException: Can't find .shp file.
       at org.apache.sedona.core.formatMapper.shapefileParser.shapes.CombineShapeReader.initialize(CombineShapeReader.java:107)
       at org.apache.spark.rdd.NewHadoopRDD$$anon$1.liftedTree1$1(NewHadoopRDD.scala:216)
       at org.apache.spark.rdd.NewHadoopRDD$$anon$1.<init>(NewHadoopRDD.scala:213)
       at org.apache.spark.rdd.NewHadoopRDD.compute(NewHadoopRDD.scala:168)
       at org.apache.spark.rdd.NewHadoopRDD.compute(NewHadoopRDD.scala:71)
       at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:349)
       at org.apache.spark.rdd.RDD.iterator(RDD.scala:313)
       at org.apache.spark.rdd.MapPartitionsRDD.compute(MapPartitionsRDD.scala:52)
       at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:349)
       at org.apache.spark.rdd.RDD.iterator(RDD.scala:313)
       at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:90)
       at org.apache.spark.scheduler.Task.run(Task.scala:127)
       at org.apache.spark.executor.Executor$TaskRunner.$anonfun$run$3(Executor.scala:446)
       at org.apache.spark.util.Utils$.tryWithSafeFinally(Utils.scala:1377)
       at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:449)
       at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
       at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
       at java.lang.Thread.run(Thread.java:748)

      The path does contain a .shp file, but its also contains a few xml files that also contain .shp, if I rename these files then I can load the shapefile.

      example shapefile tl_2020_us_zcta510.zip if i rename these files to not contain .shp or delete them then everything works as expected

      tl_2020_us_zcta510.shp.ea.iso.xml
      tl_2020_us_zcta510.shp.iso.xml

       

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              lockwobr Brian Lockwood
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 20m
                  20m