Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-22605

Python UDF should only be created once regardless of how many times it is invoked

    XMLWordPrintableJSON

Details

    Description

      Follow up to http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/I-call-Pandas-UDF-N-times-do-I-have-to-initiate-the-UDF-N-times-tt43576.html.

      Currently, when we call Python UDF N times, the Python UDF is constructed N times. This may become a concern to performance, when we want to load large resources in the open() method of the UDF, which is quite common in machine learning use cases.

      I propose we optimize in PyFlink framework level s.t. no matter how many times a UDF is called in the execution environment, it is only initiated once.

      Attachments

        Activity

          People

            Unassigned Unassigned
            Yik San Chan Yik San Chan
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: