Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Incomplete
-
2.2.0, 2.2.1
-
None
Description
When using file-based data from custom formats, Spark's ability to use Hadoop's FileInputFormats is very handy. However, when the path they are pointed at contains no usable data, they throw an IOException saying "No input paths specified in job".
It would be a nice feature if the DataFrame API somehow could capture this and return an empty DataFrame instead of failing the job.