Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Duplicate
-
2.4.0
-
None
-
None
Description
HadoopRdd filter empty files to avoid generating empty tasks that affect the performance of the Spark computing performance.
Empty file's length is zero.