Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Won't Fix
-
2.3.0
-
None
-
None
Description
OnlineLDAOptimizer should filter out empty documents beforehand in order to make corpusSize, batchSize, and nonEmptyDocsN all refer to the same filtered corpus with all non-empty docs.
Attachments
Issue Links
- is blocked by
-
SPARK-14371 OnlineLDAOptimizer should not collect stats for each doc in mini-batch to driver
- Resolved
- links to