Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-17525

SparkContext.clearFiles() still present in the PySpark bindings though the underlying Scala method was removed in Spark 2.0

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Trivial
    • Resolution: Fixed
    • 2.0.0
    • 2.0.1, 2.1.0
    • PySpark
    • None

    Description

      STR: In PySpark shell, run sc.clearFiles()

      What happens:

      py4j.protocol.Py4JError: An error occurred while calling o74.clearFiles. Trace:
      py4j.Py4JException: Method clearFiles([]) does not exist
      	at py4j.reflection.ReflectionEngine.getMethod(ReflectionEngine.java:318)
      	at py4j.reflection.ReflectionEngine.getMethod(ReflectionEngine.java:326)
      	at py4j.Gateway.invoke(Gateway.java:272)
      	at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:128)
      	at py4j.commands.CallCommand.execute(CallCommand.java:79)
      	at py4j.GatewayConnection.run(GatewayConnection.java:211)
      	at java.lang.Thread.run(Thread.java:745)
      

      Apparently the old and deprecated SparkContext.clearFiles() was removed from Spark 2.0 but it's still present in the PySpark API. It should be removed from there too.

      Attachments

        Activity

          People

            sjakthol Sami Jaktholm
            sjakthol Sami Jaktholm
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: