Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-40307 Introduce Arrow Python UDFs
  3. SPARK-43412

Introduce `SQL_ARROW_BATCHED_UDF` EvalType for Arrow Python UDFs

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.5.0
    • 3.5.0
    • Connect, PySpark
    • None

    Description

      We are about to improve nested non-atomic input/output support of an Arrow-optimized Python UDF.

      However, currently, it shares the same EvalType with a pickled Python UDF, but the same implementation with a Pandas UDF.

      Introducing an EvalType enables isolating the changes to Arrow-optimized Python UDFs.

      Attachments

        Activity

          People

            XinrongM Xinrong Meng
            XinrongM Xinrong Meng
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: