Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-4383

Enable block size support in ParquetIO

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: P3
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: io-ideas, io-java-parquet
    • Labels:
      None

      Description

      Parquet API allows block size support, which can improve IO performance when working with Parquet files. Currently, the ParquetIO does not support it at all so it looks like a room for improvement for this IO.

      Good intro into this topic: https://www.dremio.com/tuning-parquet/ 

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                ŁukaszG Lukasz Gajowy
              • Votes:
                1 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated: