Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-5562

LDA should handle empty documents

    XMLWordPrintableJSON

Details

    • Test
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 1.3.0
    • 1.5.0
    • MLlib

    Description

      Latent Dirichlet Allocation (LDA) could easily be given empty documents when people select a small vocabulary. We should check to make sure it is robust to empty documents.

      This will hopefully take the form of a unit test, but may require modifying the LDA implementation.

      Attachments

        Issue Links

          Activity

            People

              aloknsingh Alok Singh
              josephkb Joseph K. Bradley
              Joseph K. Bradley Joseph K. Bradley
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 96h
                  96h
                  Remaining:
                  Remaining Estimate - 96h
                  96h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified