Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Parquet should support
ALTER TABLE table_name [PARTITION (partition_key = 'partition_value')] CONCATENATE;
If the table or partition contains many small Parquet files, then the above command will merge them into larger files. The merge should happen at row group level thereby avoiding the overhead of decompressing and decoding the data.
It is only supported by RCFiles or ORCFiles now.
Attachments
Attachments
Issue Links
- is related to
-
PARQUET-382 Add a way to append encoded blocks in ParquetFileWriter
- Resolved