Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
1.3.0, 1.4.0, 1.5.0
-
None
Description
There is requirement to save "user-defined" Metadata as part of Sequence File "Header" using Spark.
To write a User-defined metadata as part of Sequence File using regular Hadoop API's I pass the metadata object to SequenceFile.Writer constructor which when creates a SequenceFile ensures the metadata is part of the sequence file header.
Currently Spark's JavaPairRDD Api provides methods to save an RDD to an SequenceFile format, but I don't see any API which can either give the SequenceFile.writer or a method where in I can pass the user-defined metadata so as to be written as part of sequence file header.
The enhancement request an API to implement the same