Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-48445

Don't inline UDFs with non-cheap children in CollapseProject

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.5.1
    • 4.0.0
    • SQL

    Description

      Because UDFs (and certain other expressions) are considered cheap by CollapseProject.isCheap, they are inlined and potentially duplicated (which is ok, because rules like ExtractPythonUDFs will de-duplicate them). However, if the UDFs contain other non-cheap expressions, those will also be duplicated and can potentially cause performance regressions.

      Attachments

        Issue Links

          Activity

            People

              kelvinjiang Kelvin Jiang
              kelvinjiang Kelvin Jiang
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: