Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-13030

Change OneHotEncoder to Estimator

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.6.0
    • 2.3.0
    • ML
    • None

    Description

      OneHotEncoder should be an Estimator, just like in scikit-learn (http://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html).
      In its current form, it is impossible to use when number of categories is different between training dataset and test dataset.

      Attachments

        Issue Links

          Activity

            People

              viirya L. C. Hsieh
              wjur Wojciech Jurczyk
              Votes:
              3 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: