[PARQUET-1830] Vectorized API to support Column Index in Apache Spark - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Open
Priority: Major
Resolution: Unresolved
Affects Version/s: 1.11.0
Fix Version/s: None
Component/s: parquet-mr
Labels:
None

Description

As per the comment on https://issues.apache.org/jira/browse/SPARK-26345. Its seems like Apache Spark doesn't support Column Index until we disable vectorizedReader in Spark - which will have other performance implications. As per zi , parquet-mr should implement a Vectorized API. Is it already implemented or any pull request for the same?

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Felix Kizhakkel Jose

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 26/Mar/20 19:27

Updated:: 23/Jun/24 03:31