Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-6774 Implement Parquet complex types backwards-compatiblity rules
  3. SPARK-6775

Simplify CatalystConverter class hierarchy and pass in Parquet schema

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.0.2, 1.1.1, 1.2.1, 1.3.0
    • None
    • SQL
    • None

    Description

      CataystConverter classes are used to convert Parquet records to Spark SQL row objects. Current converter implementations have the following problems:

      1. They simply ignore original Parquet schema, which makes adding Parquet backwards-compatibility rules impossible.
      2. They are unnecessary over complicated.
      3. SpecificMutableRow is only used for structs whose fields are all of primitive types.

      Attachments

        Activity

          People

            lian cheng Cheng Lian
            lian cheng Cheng Lian
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: