[PARQUET-2135] Performance optimizations: Merged all LittleEndianDataInputStream functionality into ByteBufferInputStream - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 1.12.2
Fix Version/s: None
Component/s: parquet-mr
Labels:
None

External issue URL:
https://github.com/apache/parquet-mr/pull/953
Language:
- Java

Description

This PR is all performance optimization. In benchmarking with Trino, we find query performance to improve from 5% to 15%, depending on the query, and that includes all the I/O time from S3.

The main modification is to merge all of LittleEndianDataInputStream functionality into ByteBufferInputStream, which yields the following benefits:

Elimination of extra layers of abstraction and method call overhead
Enable the use of intrinsics for readInt, readLong, etc.
Availability of faster access methods like readFully and skipFully, without the need for helper functions
Reduces some object creation in the performance critical path

This also includes and enables performance optimizations to:

ByteBitPackingValuesReader
PlainValuesReader
RunLengthBitPackingHybridDecoder

Context:
I've been working on improving Parquet reading performance in Trino, mostly by profiling while running performance benchmarks and TPCDS queries. This PR is a subset of the changes I made that have more than doubled the performance of a lot of TPCDS queries (wall clock time, including the S3 access time). If you are kind enough to accept these changes, I have more I would like to contribute.

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Timothy Miller

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 01/Apr/22 17:08

Updated:: 05/Apr/22 20:02