Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-1655

In naive Bayes, store conditional probabilities distributively.

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Auto Closed
    • None
    • None
    • MLlib

    Description

      In the current implementation, we collect all conditional probabilities to the driver node. When there are many labels and many features, this puts heavy load on the driver. For scalability, we should provide a way to store conditional probabilities distributively.

      Attachments

        Activity

          People

            staple Aaron Staple
            mengxr Xiangrui Meng
            Votes:
            1 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: