Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Cannot Reproduce
-
None
-
None
-
None
-
None
Description
When union 2 query with columns having array of bytes datatype, spark query fail with exception.
Example :
select binaryColumn from tableA
union
select binaryColumn from tableB
Note that spark properties "spark.sql.parquet.binaryAsString" is set to true
org.apache.spark.sql.AnalysisException: unresolved operator 'Union;
at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$class.failAnalysis(CheckAnalysis.scala:38) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.catalyst.analysis.Analyzer.failAnalysis(Analyzer.scala:44) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1.apply(CheckAnalysis.scala:203) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1.apply(CheckAnalysis.scala:50) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.catalyst.trees.TreeNode.foreachUp(TreeNode.scala:105) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$foreachUp$1.apply(TreeNode.scala:104) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$foreachUp$1.apply(TreeNode.scala:104) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at scala.collection.immutable.List.foreach(List.scala:381) ~[org.scala-lang.scala-library-2.11.8.jar:na]
at org.apache.spark.sql.catalyst.trees.TreeNode.foreachUp(TreeNode.scala:104) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$class.checkAnalysis(CheckAnalysis.scala:50) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.catalyst.analysis.Analyzer.checkAnalysis(Analyzer.scala:44) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.execution.QueryExecution.assertAnalyzed(QueryExecution.scala:34) ~[iop-spark-client.spark-sql_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.DataFrame.<init>(DataFrame.scala:133) ~[iop-spark-client.spark-sql_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.DataFrame$.apply(DataFrame.scala:52) ~[iop-spark-client.spark-sql_2.11-1.6.0.jar:1.6.0]
at org.apache.spark.sql.SQLContext.sql(SQLContext.scala:817) ~[iop-spark-client.spark-sql_2.11-1.6.0.jar:1.6.0]