Description
Currently RFormula can only handle label with NumericType or BinaryType (cast it to DoubleType as the label of Linear Regression training), we should also support label of StringType which is needed for Logistic Regression (glm with family = "binomial").
For label of StringType, we should use StringIndexer to transform it to 0-based index.
Attachments
Issue Links
- is depended upon by
-
SPARK-9492 LogisticRegression in R should provide model statistics
- Resolved
- is duplicated by
-
SPARK-10523 SparkR formula syntax to turn strings/factors into numerics
- Resolved
- links to