Description
StopWordsRemover takes a string array column and outputs a string array column with all defined stop words removed. The transformer should also come with a standard set of stop words as default.
val stopWords = new StopWordsRemover() .setInputCol("words") .setOutputCol("cleanWords") .setStopWords(Array(...)) // optional val output = stopWords.transform(df)
Attachments
Issue Links
- is related to
-
SPARK-9679 Add python interface for ml.feature.StopWordsRemover
- Resolved
-
SPARK-9680 Update programming guide section for ml.feature.StopWordsRemover
- Resolved
- is required by
-
SPARK-5571 LDA should handle text as well
- Resolved
- relates to
-
SPARK-9892 User Guide for StopWordsRemover
- Closed
- links to