Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-15510

SparkR NaiveBayes should not require label to have NominalAttribute

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • None
    • None
    • ML, SparkR
    • None

    Description

      Currently, SparkR's NaiveBayes API fails if the label is numeric. It works if the label is a String. This is because NaiveBayesWrapper requires that the input column be annotated with NominalAttribute, which is created when Strings are indexed by RFormula. We should eliminate this restriction since it causes failures easily, such as when trying to run NB on LibSVM datasets.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              josephkb Joseph K. Bradley
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: