Description
Current only support the selection of categorical features, while there are many requirements to the selection of continuous distribution features.
ANOVA F-value is one way to select features from the continuous distribution and it's important to support it in spark.
Attachments
Issue Links
- is related to
-
SPARK-34080 Add UnivariateFeatureSelector to deprecate existing selectors
- Resolved
- links to
1.
|
Support FValueRegressionSelector for continuous features and continuous labels | Resolved | Huaxin Gao | |
2.
|
add FValueRegressionTest | Resolved | Huaxin Gao | |
3.
|
Remove ChiSqSelector dependency on mllib.ChiSqSelectorModel | Resolved | Huaxin Gao | |
4.
|
Add abstract Selector | Resolved | Huaxin Gao | |
5.
|
Add ANOVASelector | Resolved | Huaxin Gao | |
6.
|
add ANOVATest and FValueTest to PySpark | Resolved | Huaxin Gao | |
7.
|
Add ANOVATest example | Resolved | kevin yu | |
8.
|
Add FValueTest Examples | Resolved | kevin yu | |
9.
|
Add ANOVASelector and FValueSelector to PySpark | Resolved | Huaxin Gao | |
10.
|
Add docs and examples for ANOVASelector and FValueSelector | Resolved | Huaxin Gao |