Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-14550

OneHotEncoding wrapper in SparkR

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Later
    • None
    • None
    • ML, SparkR
    • None

    Description

      Implement OneHotEncoding in R.

      In R , usually one can use model.matrix to do one hot encoding. which accepts formula. I think we can support simple formula here.

      model.matrix doc: https://stat.ethz.ch/R-manual/R-devel/library/stats/html/model.matrix.html

      here is the example, that would be nice to have
      example :
      http://stackoverflow.com/questions/16200241/recode-categorical-factor-with-n-categories-into-n-binary-columns

      Attachments

        Activity

          People

            Unassigned Unassigned
            aloknsingh Alok Singh
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: