Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-1830

Vectorized API to support Column Index in Apache Spark

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.11.0
    • None
    • parquet-mr
    • None

    Description

      As per the comment on https://issues.apache.org/jira/browse/SPARK-26345. Its seems like Apache Spark doesn't support Column Index until we disable vectorizedReader in Spark - which will have other performance implications. As per zi , parquet-mr should implement a Vectorized API. Is it already implemented or any pull request for the same?

      Attachments

        Activity

          People

            Unassigned Unassigned
            FelixKJose Felix Kizhakkel Jose
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated: