Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-48555

Support Column type for several SQL functions in scala and python

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      Currently, several SQL functions accept both native types and Columns, but only accept native types in their scala/python APIs:

      • array_remove (works in SQL, scala, not in python)
      • array_position(works in SQL, scala, not in python)
      • map_contains_key (works in SQL, scala, not in python)
      • substring (works only in SQL)

      For example, this is possible in SQL:

      spark.sql("select array_remove(col1, col2) from values(array(1,2,3), 2)")
      

      But not in python:

      df.select(F.array_remove(F.col("col1"), F.col("col2"))
      

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            ronserruya Ron Serruya
            ronserruya Ron Serruya
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment