Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-21791

ORC should support column names with dot

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • 2.0.0, 2.1.0, 2.2.0
    • 2.3.0
    • SQL
    • None

    Description

      PARQUET

      scala> Seq(Some(1), None).toDF("col.dots").write.parquet("/tmp/parquet_dot")
      scala> spark.read.parquet("/tmp/parquet_dot").show
      +--------+
      |col.dots|
      +--------+
      |       1|
      |    null|
      +--------+
      

      ORC

      scala> Seq(Some(1), None).toDF("col.dots").write.orc("/tmp/orc_dot")
      scala> spark.read.orc("/tmp/orc_dot").show
      org.apache.spark.sql.catalyst.parser.ParseException:
      mismatched input '.' expecting ':'(line 1, pos 10)
      
      == SQL ==
      struct<col.dots:int>
      ----------^^^
      

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            dongjoon Dongjoon Hyun
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment