Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Won't Fix
-
None
-
None
-
None
Description
Right now it's hard to debug which input files or blocks therein have invalid data. The InputSplit for a HadoopRDD is not even exposed programmatically in Scala/Java (it's private[spark]).