Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-14808 Spark MLlib, GraphX, SparkR 2.0 QA umbrella
  3. SPARK-14817

ML, Graph, R 2.0 QA: Programming guide update and migration guide

    XMLWordPrintableJSON

Details

    Description

      Before the release, we need to update the MLlib, GraphX, and SparkR Programming Guides. Updates will include:

      • Add migration guide subsection.
        • Use the results of the QA audit JIRAs and SPARK-13448.
      • Check phrasing, especially in main sections (for outdated items such as "In this release, ...")

      For MLlib, we will make the DataFrame-based API (spark.ml) front-and-center, to make it clear the RDD-based API is the older, maintenance-mode one.

      • No docs for spark.mllib will be deleted; they will just be reorganized and put in a subsection.
      • If spark.ml docs are less complete, or if spark.ml docs say "refer to the spark.mllib docs for details," then we should copy those details to the spark.ml docs. This per-feature work can happen under SPARK-14815.
      • This big reorganization should be done after docs are added for each feature (to minimize merge conflicts).

      Attachments

        Issue Links

          Activity

            People

              josephkb Joseph K. Bradley
              josephkb Joseph K. Bradley
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: