Uploaded image for project: 'Parquet'
  1. Parquet
  2. PARQUET-221

For array type, inconsistent names are used as the array element name.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.6.0
    • None
    • parquet-mr
    • None

    Description

      When creating a convert for an array, Parquet Avro uses "array" as the field name name (see here) , but Parquet Hive SerDe uses "array_element" as the field name see here. In Spark SQL, our native Parquet support is following Parquet Avro's convention, for data generated by Parquet Hive SerDe, the array value cannot be correctly read and null will be returned.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              yhuai Yin Huai
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated: