Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
Parquet ColumnarRow split reader is introduced to read parquet data in batches.
When returning each row of data, instead of actually retrieving each field, we use BaseRow's abstraction to return a Columnar Row-like view.
This will greatly improve the downstream filtered scenarios, so that there is no need to access redundant fields on the filtered data.
Attachments
Issue Links
- duplicates
-
FLINK-15164 Introduce ParquetColumnarRowSplitReader to parquet format
-
- Closed
-
- is a child of
-
FLINK-11492 Support the full Blink SQL runtime
-
- Closed
-
- is related to
-
FLINK-16032 Depends on core classifier hive-exec in hive connector
-
- Open
-
- relates to
-
FLINK-16359 Introduce WritableVectors for abstract writing
-
- Resolved
-
- links to