Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
None
Description
In ParquetRecordReaderWrapper constructor, it create splits, set projections and filters in conf, create task context, and then create Parquet record reader. In this procedure, we could improve the logic of conf usage:
1. the updated jobConf is not passed to Parquet in one case. (https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/read/ParquetRecordReaderWrapper.java#L103)
I think we should use the instance variable jobConf, which is updated when projectioning and filtering, instead of passed in oldJobConf.