Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-6561

Add partition support in saveAsParquet

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • None
    • 1.4.0
    • SQL
    • None

    Description

      Now ParquetRelation2 supports automatic partition discovery which is very nice.

      When we save a DataFrame into Parquet files, we also want to have it partitioned.

      The proposed API looks like this:

      def saveAsParquetFile(path: String, partitionColumns: Seq[String])
      

      Jianshi

      Attachments

        Issue Links

          Activity

            People

              lian cheng Cheng Lian
              huangjs Jianshi Huang
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: