Uploaded image for project: 'Apache Hop (Retired)'
  1. Apache Hop (Retired)
  2. HOP-1963

Can we read and write parquet data please?

    XMLWordPrintableJSON

Details

    Description

      When working in S3, or even HDFS, parquet is pretty much the defacto binary format. It'd be great to be able to read and write to it.

      At the same time it would be nice if you could specify size limits for parquet files (e.g. 128mb) so that we don't create one massive file.

      If this could work via vfs then all the better! So you can read and write parquet directly from s3 etc.

      Attachments

        Activity

          People

            Unassigned Unassigned
            mxm Maximilian Michels
            Votes:
            1 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 40m
                40m