Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-45023

SPIP: Python Stored Procedures

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Won't Do
    • 4.0.0
    • None
    • PySpark, SQL
    • None

    Description

      Stored procedures are an extension of the ANSI SQL standard. They play a crucial role in improving the capabilities of SQL by encapsulating complex logic into reusable routines.

      This proposal aims to extend Spark SQL by introducing support for stored procedures, starting with Python as the procedural language. This addition will allow users to execute procedural programs, leveraging programming constructs of Python to perform tasks with complex logic. Additionally, users can persist these procedural routines in catalogs such as HMS for future reuse. By providing this functionality, we intend to seamlessly empower Spark users to integrate with Python routines within their SQL workflows.

      SPIP: https://docs.google.com/document/d/1ce2EZrf2BxHu7TjfGn4TgToK3TBYYzRkmsIVcfmkNzE/edit?usp=sharing

       

      Attachments

        Activity

          People

            Unassigned Unassigned
            allisonwang-db Allison Wang
            Hyukjin Kwon Hyukjin Kwon
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: