Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-15571

Pipeline unit test improvements

    XMLWordPrintableJSON

Details

    • Test
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • None
    • None
    • ML, PySpark

    Description

      Issue:

      • There are several pieces of standard functionality shared by all algorithms: Params, UIDs, fit/transform/save/load, etc. Currently, these pieces are generally tested in ad hoc tests for each algorithm.
      • This has led to inconsistent coverage, especially within the Python API.

      Goal:

      • Standardize unit tests for Scala and Python to improve and consolidate test coverage for Params, persistence, and other common functionality.
      • Eliminate duplicate code. Improve test coverage. Simplify adding these standard unit tests for future algorithms and APIs.

      This will require several subtasks. If you identify an issue, please create a subtask, or comment below if the issue needs to be discussed first.

      Attachments

        Activity

          People

            Unassigned Unassigned
            josephkb Joseph K. Bradley
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: