Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-43357

Spark AWS Glue date partition push down broken

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.1.0, 3.1.1, 3.1.2, 3.2.0, 3.1.3, 3.2.1, 3.3.0, 3.2.2, 3.3.1, 3.2.3, 3.2.4, 3.3.2
    • 3.5.0
    • SQL
    • None

    Description

      When using the following project: https://github.com/awslabs/aws-glue-data-catalog-client-for-apache-hive-metastore
      To have glue supported as as a hive metastore for spark there is an issue when reading a date-partitioned data set. Writing is fine.
      You get the following error: 

      org.apache.hadoop.hive.metastore.api.InvalidObjectException: Unsupported expression '2023 - 05 - 03' (Service: AWSGlue; Status Code: 400; Error Code: InvalidInputException; Request ID: beed68c6-b228-442e-8783-52c25b9d2243; Proxy: null)

       

      A fix for this is making sure the date passed to glue is quoted

      Attachments

        Activity

          People

            sdehaes Stijn De Haes
            sdehaes Stijn De Haes
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: