Such a StringInputFormat would be like TextInputFormat but with input types of Long and String, rather than LongWritable and Text. This would allow users to write MapReduce programs that used only Java native types (i.e. no Writables).
Such a StringInputFormat would be like TextInputFormat but with input types of Long and String, rather than LongWritable and Text. This would allow users to write MapReduce programs that used only Java native types (i.e. no Writables).
This is currently not possible to write without changes to Hadoop due to a limitation in the RecordReader interface explained here: https://issues.apache.org/jira/browse/HADOOP-3413?focusedCommentId=12597935#action_12597935