Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-1655

In naive Bayes, store conditional probabilities distributively.

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Auto Closed
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: MLlib
    • Labels:

      Description

      In the current implementation, we collect all conditional probabilities to the driver node. When there are many labels and many features, this puts heavy load on the driver. For scalability, we should provide a way to store conditional probabilities distributively.

        Attachments

          Activity

            People

            • Assignee:
              staple Aaron Staple
              Reporter:
              mengxr Xiangrui Meng
            • Votes:
              1 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: