Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-41096

Support reading parquet FIXED_LEN_BYTE_ARRAY type

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.4.0
    • 3.4.0
    • SQL
    • None

    Description

      Parquet has FIXED_LEN_BYTE_ARRAY (FLBA) data type. However, Spark Parquet reader currently cannot handle it.
      Read it as BinaryType in Spark.

      Iceberg Parquet reader, for example, can handle FLBA. This improvement should reduce the gap between Spark and Iceberg Parquet reader.

      Attachments

        Issue Links

          Activity

            People

              kazuyukitanimura Kazuyuki Tanimura
              kazuyukitanimura Kazuyuki Tanimura
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: