Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-38083

set the amount of explained variance as parameter of pyspark.ml.feature.PCA

    XMLWordPrintableJSON

Details

    • Wish
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.2.2
    • None
    • ML, MLlib
    • None

    Description

      As in sklearn.decomposition.PCA, where:

      if 0 < n_components < 1 select the number of components such that the amount of variance that needs to be explained is greater than the percentage specified by n_components

      it would be useful to have a similar behavior with the k parameter in pyspark.ml.feature.PCA.

      Attachments

        Activity

          People

            Unassigned Unassigned
            nicpado Nicola
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: