Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
Description
We currently provide InputFormats for reading from Accumulo and output formats for both direct input as well as outputting RFiles. But we provide no mechanism for doing a mapreduce over existing RFiles, which may be useful for optimizing data flow. We already have input formats which use RFiles directly for input (The offline input format Keith just finished), but that still relies on the Accumulo structure. We should go ahead and also create an input format that just hits RFiles like the other standard file input formats.
Attachments
Issue Links
- duplicates
-
ACCUMULO-418 Make RFiles splittable
- Resolved