Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3491

Use pickle to serialize the data in MLlib Python

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.2.0
    • MLlib, PySpark
    • None

    Description

      Currently, we write the code for serialization/deserialization in Python and Scala manually, it can not scale to the big number of MLlib API.

      If the serialization could be done in pickle (using Pyrolite in JVM) in extensional way, then it should be much easy to add Python API for MLlib.

      Attachments

        Issue Links

          Activity

            People

              davies Davies Liu
              davies Davies Liu
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: