Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-36338

Move distributed-sequence implementation to Scala side

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.2.0
    • 3.2.0
    • PySpark
    • None

    Description

      https://github.com/apache/spark/blob/c22f7a4834e6fb7b69c4cc4af87c61c2fbbe0786/python/pyspark/pandas/internal.py#L925-L945

      This can be implemented in JVM side to make it more performance without extra serializations, and working around the nullability.

      Attachments

        Activity

          People

            gurwls223 Hyukjin Kwon
            gurwls223 Hyukjin Kwon
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: