Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-22586

Feature selection

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Invalid
    • 2.2.0
    • None
    • ML
    • None

    Description

      Hello everyone,

      I would like to know if there are plans to add different score functions to perform feature selection under the same interface. I saw two previous issues related to the topic:

      https://issues.apache.org/jira/browse/SPARK-6531
      https://issues.apache.org/jira/browse/SPARK-1473

      However, it seems nothing was added at the end. I would like to know if there was some problem then, because I wouldn't mind taking a closer look to it in case people would be interested.

      Additionally, I think it would be interested to include a score metric between continuous attributes (for regression), and between continuous and discrete (for classification). This has already been done successfully on http://scikit-learn.org/stable/modules/feature_selection.html#univariate-feature-selection

      Attachments

        Activity

          People

            Unassigned Unassigned
            jorge.glez.lopez Jorge Gonzalez Lopez
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: