Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-10595

Various ML programming guide cleanups post 1.5

    XMLWordPrintableJSON

    Details

    • Type: Documentation
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.5.0
    • Fix Version/s: 1.6.0
    • Component/s: Documentation, ML, MLlib
    • Labels:
      None

      Description

      Various ML guide cleanups.

      • ml-guide.md: Make it easier to access the algorithm-specific guides.
      • LDA user guide: EM often begins with useless topics, but running longer generally improves them dramatically. E.g., 10 iterations on a Wikipedia dataset produces useless topics, but 50 iterations produces very meaningful topics.
      • mllib-feature-extraction.html#elementwiseproduct: “w” parameter should be “scalingVec”
      • Clean up Binarizer user guide a little.
      • Document in Pipeline that users should not put an instance into the Pipeline in more than 1 place.
      • spark.ml Word2Vec user guide: clean up grammar/writing
      • Chi Sq Feature Selector docs: Improve text in doc.

        Attachments

          Activity

            People

            • Assignee:
              josephkb Joseph K. Bradley
              Reporter:
              josephkb Joseph K. Bradley
              Shepherd:
              Feynman Liang
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: