[SPARK-21542] Helper functions for custom Python Persistence - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.2.0
Fix Version/s: 2.3.0
Component/s: ML, PySpark
Labels:
None

Description

Currently, there is no way to easily persist Json-serializable parameters in Python only. All parameters in Python are persisted by converting them to Java objects and using the Java persistence implementation. In order to facilitate the creation of custom Python-only pipeline stages, it would be good to have a Python-only persistence framework so that these stages do not need to be implemented in Scala for persistence.

This task involves:

Adding implementations for DefaultParamsReadable, DefaultParamsWriteable, DefaultParamsReader, and DefaultParamsWriter in pyspark.

Attachments

Issue Links

relates to

SPARK-17025 Cannot persist PySpark ML Pipeline model that includes custom Transformer

Resolved

links to

PR 18742

Activity

People

Assignee:: Ajay Saini

Reporter:: Ajay Saini

Shepherd:: Joseph K. Bradley

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 26/Jul/17 23:04

Updated:: 09/Nov/18 20:07

Resolved:: 08/Aug/17 00:04