Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Won't Fix
-
None
-
None
-
None
Description
Some time ago, we have commented with Ameet Talwalkar the possibilty of including both Feature Selection and Discretization algorithms to MLlib.
In this patch we've implemented Entropy Minimization Discretization following the algorithm described in the paper "Multi-interval discretization of continuous-valued attributes for classification learning" by Fayyad and Irani (1993). This is one of the most used Discretizers and is already included in most libraries like Weka, etc. This can be used as base for FS algorims and the NaiveBayes already included in MLlib.
Attachments
Issue Links
- contains
-
SPARK-6509 MDLP discretizer
- Resolved
- links to