Details
Description
Background
RowFile is the row-oriented binary file format of Tajo. Currently, RowFile does not support compression.
Proposal
The goal of this proposal is to implement block-based compression of RowFile. Requirements are follows.
- The compression type should be configurable via TableMeta.
- The block size also should be configurable via Configuration.
- LZO, BZip2, GZip should be supported. (I recommend utilizing Hadoop's CompressionInputStream/CompressionOutputStream)
Attachments
Issue Links
- is related to
-
TAJO-134 Support for compression/decompression of CSVFile
- Resolved