Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-9647 MLlib + SparkR integration for 1.6
  3. SPARK-11349

Support transform string label for RFormula

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.6.0
    • ML
    • None

    Description

      Currently RFormula can only handle label with NumericType or BinaryType (cast it to DoubleType as the label of Linear Regression training), we should also support label of StringType which is needed for Logistic Regression (glm with family = "binomial").
      For label of StringType, we should use StringIndexer to transform it to 0-based index.

      Attachments

        Issue Links

          Activity

            People

              yanboliang Yanbo Liang
              yanboliang Yanbo Liang
              Xiangrui Meng Xiangrui Meng
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: