Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23483 Feature parity for Python vs Scala APIs
  3. SPARK-23615

Add maxDF Parameter to Python CountVectorizer

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.4.0
    • 2.4.0
    • ML, PySpark
    • None

    Description

      The maxDF parameter is for filtering out frequently occurring terms. This param was recently added to the Scala CountVectorizer and needs to be added to Python also.

      Attachments

        Issue Links

          Activity

            People

              huaxing Huaxin Gao
              bryanc Bryan Cutler
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: