Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-2882

[C++][Python] Support AWS Firehose partition_scheme implementation for Parquet datasets

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: 2.0.0
    • Component/s: C++, Python
    • Labels:

      Description

      I'd like to be able to read a ParquetDataset generated by AWS Firehose.

      The only implementation at the time of writting was the partition scheme created by hive (year=2018/month=01/day=11).

      AWS Firehose partition scheme is a little bit different (2018/01/11).

       

      Thanks

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              IceS2 Pablo Javier Takara
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated: