Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-7653

ML Pipeline and meta-algs should take random seed param

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Incomplete
    • None
    • None
    • ML

    Description

      ML Pipelines and other meta-algorithms should implement HasSeed. If the seed is set, then the meta-alg will use that seed to generate new seeds to pass to every component PipelineStage.

      • Note: This may require a little discussion about whether HasSeed should be a public API.

      This will make it easier for users to have reproducible results for entire pipelines (rather than setting the seed for each stage manually).

      Attachments

        Activity

          People

            Unassigned Unassigned
            josephkb Joseph K. Bradley
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - 1h
                1h
                Remaining:
                Remaining Estimate - 1h
                1h
                Logged:
                Time Spent - Not Specified
                Not Specified