Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-22979

Avoid per-record type dispatch in Python data conversion (EvaluatePython.fromJava)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.3.0
    • 2.3.0
    • PySpark, SQL
    • None

    Description

      Seems we are type dispatching between Java objects (from Pyrolite) to Spark's internal data format.

      See https://github.com/apache/spark/blob/3f958a99921d149fb9fdf7ba7e78957afdad1405/sql/core/src/main/scala/org/apache/spark/sql/execution/python/EvaluatePython.scala#L89-L162

      Looks we can make converters each for each type and then reuse it.

      Attachments

        Activity

          People

            gurwls223 Hyukjin Kwon
            gurwls223 Hyukjin Kwon
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: