Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-4383

Enable block size support in ParquetIO

Details

    • Improvement
    • Status: Open
    • P3
    • Resolution: Unresolved
    • None
    • None
    • io-ideas, io-java-parquet
    • None

    Description

      Parquet API allows block size support, which can improve IO performance when working with Parquet files. Currently, the ParquetIO does not support it at all so it looks like a room for improvement for this IO.

      Good intro into this topic: https://www.dremio.com/tuning-parquet/ 

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ŁukaszG Lukasz Gajowy
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: