Description
STR: In PySpark shell, run sc.clearFiles()
What happens:
py4j.protocol.Py4JError: An error occurred while calling o74.clearFiles. Trace: py4j.Py4JException: Method clearFiles([]) does not exist at py4j.reflection.ReflectionEngine.getMethod(ReflectionEngine.java:318) at py4j.reflection.ReflectionEngine.getMethod(ReflectionEngine.java:326) at py4j.Gateway.invoke(Gateway.java:272) at py4j.commands.AbstractCommand.invokeMethod(AbstractCommand.java:128) at py4j.commands.CallCommand.execute(CallCommand.java:79) at py4j.GatewayConnection.run(GatewayConnection.java:211) at java.lang.Thread.run(Thread.java:745)
Apparently the old and deprecated SparkContext.clearFiles() was removed from Spark 2.0 but it's still present in the PySpark API. It should be removed from there too.