Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Invalid
-
2.3.0
-
None
-
None
Description
We have developed a custom file input format and calling it in pyspark using newAPIHadoopFile option. It appears there is no option to pass parameters dynamically to the custom format.
rdd2 = sc.newAPIHadoopFile("/abcd/efgh/i1.txt", "com.test1.TEST2.TESTInputFormat", "org.apache.hadoop.io.Text", "org.apache.hadoop.io.NullWritable")