Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Not A Problem
-
2.0.1
-
None
-
None
Description
If the paths Seq parameter contains a lot of elements, then DataFrameReader.load takes a lot of time starting the job as it attempts to check if each of the path exists using fs.exists. There should be a boolean configuration option to disable the checking for path's existence and that should be passed in as parameter to DataSource.resolveRelation call.