Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-16704

Union does not work for column with array byte

Log workAgile BoardRank to TopRank to BottomAttach filesAttach ScreenshotBulk Copy AttachmentsBulk Move AttachmentsVotersWatch issueWatchersCreate sub-taskConvert to sub-taskMoveLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete CommentsDelete
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Cannot Reproduce
    • None
    • None
    • None
    • None

    Description

      When union 2 query with columns having array of bytes datatype, spark query fail with exception.

      Example :
      select binaryColumn from tableA
      union
      select binaryColumn from tableB

      Note that spark properties "spark.sql.parquet.binaryAsString" is set to true

      org.apache.spark.sql.AnalysisException: unresolved operator 'Union;
      at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$class.failAnalysis(CheckAnalysis.scala:38) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.catalyst.analysis.Analyzer.failAnalysis(Analyzer.scala:44) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1.apply(CheckAnalysis.scala:203) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$$anonfun$checkAnalysis$1.apply(CheckAnalysis.scala:50) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.catalyst.trees.TreeNode.foreachUp(TreeNode.scala:105) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$foreachUp$1.apply(TreeNode.scala:104) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.catalyst.trees.TreeNode$$anonfun$foreachUp$1.apply(TreeNode.scala:104) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at scala.collection.immutable.List.foreach(List.scala:381) ~[org.scala-lang.scala-library-2.11.8.jar:na]
      at org.apache.spark.sql.catalyst.trees.TreeNode.foreachUp(TreeNode.scala:104) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.catalyst.analysis.CheckAnalysis$class.checkAnalysis(CheckAnalysis.scala:50) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.catalyst.analysis.Analyzer.checkAnalysis(Analyzer.scala:44) ~[iop-spark-client.spark-catalyst_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.execution.QueryExecution.assertAnalyzed(QueryExecution.scala:34) ~[iop-spark-client.spark-sql_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.DataFrame.<init>(DataFrame.scala:133) ~[iop-spark-client.spark-sql_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.DataFrame$.apply(DataFrame.scala:52) ~[iop-spark-client.spark-sql_2.11-1.6.0.jar:1.6.0]
      at org.apache.spark.sql.SQLContext.sql(SQLContext.scala:817) ~[iop-spark-client.spark-sql_2.11-1.6.0.jar:1.6.0]

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned Assign to me
            jiunnjye Ng Jiunn Jye
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment