[SPARK-21144] Unexpected results when the data schema and partition schema have the duplicate columns - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.2.0
Fix Version/s: 2.2.0
Component/s: SQL
Labels:
None

Target Version/s:

2.2.0

Description

    withTempPath { dir =>
      val basePath = dir.getCanonicalPath
      spark.range(0, 3).toDF("foo").write.parquet(new Path(basePath, "foo=1").toString)
      spark.range(0, 3).toDF("foo").write.parquet(new Path(basePath, "foo=a").toString)
      spark.read.parquet(basePath).show()
    }

The result of the above case is

+---+
|foo|
+---+
|  1|
|  1|
|  a|
|  a|
|  1|
|  a|
+---+

Attachments

Issue Links

links to

[Github] Pull Request #17758 (maropu)

[Github] Pull Request #18356 (maropu)

[Github] Pull Request #18375 (maropu)

Activity

People

Assignee:: Takeshi Yamamuro

Reporter:: Xiao Li

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 19/Jun/17 21:39

Updated:: 23/Jun/17 16:31

Resolved:: 23/Jun/17 16:31