Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-31492

flatten the result dataframe of FValueTest

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 3.1.0
    • 3.1.0
    • ML
    • None

    Description

      add a new method

      @Since("3.1.0")
      def test(
          dataset: DataFrame,
          featuresCol: String,
          labelCol: String,
          flatten: Boolean): DataFrame 

       

      Similar to new test method in ChiSquareTest, it will:

      1, support df operation on the returned df;

      2, make driver no longer a bottleneck when dim is high

       

       

      Attachments

        Issue Links

          Activity

            People

              podongfeng Ruifeng Zheng
              podongfeng Ruifeng Zheng
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: