Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-19459

ORC tables cannot be read when they contain char/varchar columns

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.0.2, 2.1.0
    • Fix Version/s: 2.1.1, 2.2.0
    • Component/s: SQL
    • Labels:
      None

      Description

      Reading from an ORC table which contains char/varchar columns can fail if the table has been created using Spark. This is caused by the fact that spark internally replaces char and varchar columns with a string column, this causes the ORC reader to use the wrong reader, and that eventually causes a ClassCastException.

        Issue Links

          Activity

          Hide
          apachespark Apache Spark added a comment -

          User 'hvanhovell' has created a pull request for this issue:
          https://github.com/apache/spark/pull/16804

          Show
          apachespark Apache Spark added a comment - User 'hvanhovell' has created a pull request for this issue: https://github.com/apache/spark/pull/16804
          Hide
          apachespark Apache Spark added a comment -

          User 'hvanhovell' has created a pull request for this issue:
          https://github.com/apache/spark/pull/17030

          Show
          apachespark Apache Spark added a comment - User 'hvanhovell' has created a pull request for this issue: https://github.com/apache/spark/pull/17030
          Hide
          apachespark Apache Spark added a comment -

          User 'hvanhovell' has created a pull request for this issue:
          https://github.com/apache/spark/pull/17041

          Show
          apachespark Apache Spark added a comment - User 'hvanhovell' has created a pull request for this issue: https://github.com/apache/spark/pull/17041
          Hide
          apachespark Apache Spark added a comment -

          User 'dongjoon-hyun' has created a pull request for this issue:
          https://github.com/apache/spark/pull/19235

          Show
          apachespark Apache Spark added a comment - User 'dongjoon-hyun' has created a pull request for this issue: https://github.com/apache/spark/pull/19235

            People

            • Assignee:
              hvanhovell Herman van Hovell
              Reporter:
              hvanhovell Herman van Hovell
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development