Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-14236

[Python] Write to Parquet support for list to conform with Apache Parquet specification

Details

    • Bug
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • None
    • Not applicable
    • io-py-parquet
    • None
    • Patch

    Description

      ARROW-11497 The pyarrow parquet writer now support the list type contains 3 level where the middle level, named list, must be a repeated group with a single field named element. https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#lists

      I think we can simply populate it to WriteToParquet by adding additional flag (use_compliant_nested_type) to 
       conform with Apache Parquet specification.

      Attachments

        Issue Links

          Activity

            People

              Shiv22Wabale Shivraj Devidas Wabale
              Shiv22Wabale Shivraj Devidas Wabale
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3.5h
                  3.5h