Step 1 of 4: Choose Issues

Cancel

T Patch Info Key Summary Assignee Reporter P Status Resolution Created Updated Due Development
Sub-task SPARK-38891

SPARK-35743 Skipping allocating vector for repetition & definition levels when possible

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-38840

SPARK-35743 Enable spark.sql.parquet.enableNestedColumnVectorizedReader on master branch by default

Chao Sun Chao Sun Minor Resolved Fixed  
Sub-task SPARK-38179

SPARK-35743 Improve WritableColumnVector to better support null struct

Unassigned Chao Sun Minor Resolved Won't Fix  
Sub-task SPARK-37864

SPARK-35743 Support Parquet v2 data page RLE encoding (for Boolean Values) for the vectorized path

Yang Jie Yang Jie Major Resolved Fixed  
Sub-task SPARK-36935

SPARK-35743 Enhance ParquetSchemaConverter to capture Parquet repetition & definition level

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-36891

SPARK-35743 Refactor SpecificParquetRecordReaderBase and add more coverage on vectorized Parquet decoding

Chao Sun Chao Sun Major Resolved Fixed  
Sub-task SPARK-36879

SPARK-35743 Support Parquet v2 data page encodings for the vectorized path

Parth Chandra Chao Sun Major Resolved Fixed  
Sub-task SPARK-36529

SPARK-35743 Decouple CPU with IO work in vectorized Parquet reader

Unassigned Chao Sun Major Open Unresolved  
Sub-task SPARK-36528

SPARK-35743 Implement lazy decoding for the vectorized Parquet reader

Unassigned Chao Sun Major Open Unresolved  
Sub-task SPARK-36527

SPARK-35743 Implement lazy materialization for the vectorized Parquet reader

Unassigned Chao Sun Major Open Unresolved  
Sub-task SPARK-36511

SPARK-35743 Remove ColumnIO once PARQUET-2050 is released in Parquet 1.13

BingKun Pan Chao Sun Minor Resolved Fixed  
Sub-task SPARK-36131

SPARK-35743 Refactor ParquetColumnIndexSuite

Chao Sun Chao Sun Minor Resolved Fixed  
Sub-task SPARK-36123

SPARK-35743 Parquet vectorized reader doesn't skip null values correctly

Chao Sun Chao Sun Blocker Resolved Fixed  
Sub-task SPARK-36056

SPARK-35743 Combine readBatch and readIntegers in VectorizedRleValuesReader

Chao Sun Chao Sun Minor Resolved Fixed  
Sub-task SPARK-35867

SPARK-35743 Enable vectorized read for VectorizedPlainValuesReader.readBooleans

Kazuyuki Tanimura Chao Sun Minor Resolved Fixed  
Sub-task SPARK-35846

SPARK-35743 Introduce ParquetReadState to track various states while reading a Parquet column chunk

Chao Sun Chao Sun Minor Resolved Fixed  
Sub-task SPARK-35640

SPARK-35743 Refactor Parquet vectorized reader to remove duplicated code paths

Chao Sun Chao Sun Major Resolved Fixed  

Cancel