Details
Description
pyspark.mllib.clustering.LDAModel has no way to estimate the topic distribution for new documents. However, this functionality exists in org.apache.spark.mllib.clustering.LDAModel. This change would only require setting up the API calls. I have forked the spark repo and implemented the changes locally
Attachments
Issue Links
- depends upon
-
SPARK-5567 Add prediction methods to LDA
- Resolved
- links to