Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-7751

Add @Since annotation to stable and experimental methods in MLlib

    XMLWordPrintableJSON

Details

    • Umbrella
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 2.0.0
    • Documentation, MLlib

    Description

      This is useful to check whether a feature exists in some version of Spark. This is an umbrella JIRA to track the progress. We want to have @since tag @Since annotation for both stable (those without any Experimental/DeveloperApi/AlphaComponent annotations) and experimental methods in MLlib:

      (Do NOT tag private or package private classes or methods, nor local variables and methods.)

      We need to dig the history of git commit to figure out what was the Spark version when a method was first introduced. Take `NaiveBayes.setModelType` as an example. We can grep `def setModelType` at different version git tags.

      meng@xm:~/src/spark
      $ git show v1.3.0:mllib/src/main/scala/org/apache/spark/mllib/classification/NaiveBayes.scala | grep "def setModelType"
      meng@xm:~/src/spark
      $ git show v1.4.0:mllib/src/main/scala/org/apache/spark/mllib/classification/NaiveBayes.scala | grep "def setModelType"
        def setModelType(modelType: String): NaiveBayes = {
      

      If there are better ways, please let us know.

      We cannot add all @since tags @Since annotation in a single PR, which is hard to review. So we made some subtasks for each package, for example `org.apache.spark.classification`. Feel free to add more sub-tasks for Python and the `spark.ml` package.

      Plan:

      1. In 1.5, we try to add @Since annotation to all stable/experimental methods under `spark.mllib`.
      2. Starting from 1.6, we require @Since annotation in all new PRs.
      3. In 1.6, we try to add @SInce annotation to all stable/experimental methods under `spark.ml`, `pyspark.mllib`, and `pyspark.ml`.

      Attachments

        Issue Links

          1.
          Add @since tags to mllib.classification Sub-task Resolved Patrick Baier

          0%

          Original Estimate - 1h
          Remaining Estimate - 1h
          2.
          Add @since tags to mllib.regression Sub-task Resolved Prayag Chandran Nirmala

          0%

          Original Estimate - 1h
          Remaining Estimate - 1h
          3.
          Add @since tags to mllib.linalg Sub-task Resolved Unassigned

          0%

          Original Estimate - 2h
          Remaining Estimate - 2h
          4.
          Add @since tags to mllib.clustering Sub-task Resolved Xiangrui Meng

          0%

          Original Estimate - 2h
          Remaining Estimate - 2h
          5.
          Add @since tags to mllib.recommendation Sub-task Resolved Vinod KC

          0%

          Original Estimate - 1h
          Remaining Estimate - 1h
          6.
          Add @since tags to mllib.linalg Sub-task Resolved Sameer Abhyankar

          0%

          Original Estimate - 4h
          Remaining Estimate - 4h
          7.
          Add @since tags to mllib.stat Sub-task Resolved bimal tandel

          0%

          Original Estimate - 2h
          Remaining Estimate - 2h
          8.
          Add @since tags to mllib.evaluation Sub-task Resolved Xiangrui Meng

          0%

          Original Estimate - 1h
          Remaining Estimate - 1h
          9.
          Add @since tags to mllib.fpm Sub-task Resolved Rahul Palamuttam

          0%

          Original Estimate - 0.5h
          Remaining Estimate - 0.5h
          10.
          Add @since tags to mllib.tree Sub-task Resolved Bryan Cutler

          0%

          Original Estimate - 4h
          Remaining Estimate - 4h
          11.
          Add @since tags to mllib.util Sub-task Resolved Sudhakar Thota

          0%

          Original Estimate - 1h
          Remaining Estimate - 1h
          12.
          Add @Since annotation to new public APIs in 1.5 Sub-task Resolved Manoj Kumar  
          13.
          Add @Since annotation to mllib.feature Sub-task Resolved Manoj Kumar  
          14.
          Update @Since annotation for mllib.classification Sub-task Resolved Xiangrui Meng  
          15.
          Update @Since annotation for mllib.evaluation Sub-task Resolved Xiangrui Meng  
          16.
          Update @Since annotation for mllib.clustering Sub-task Resolved Xiangrui Meng  
          17.
          Update @Since annotation for mllib.regression Sub-task Resolved Xiangrui Meng  
          18.
          Update @Since annotation for mllib.feature Sub-task Resolved Xiangrui Meng  
          19.
          Update @Since annotation for mllib.fpm Sub-task Resolved Xiangrui Meng  
          20.
          Update @Since annotation for mllib.linalg Sub-task Resolved Xiangrui Meng  
          21.
          Update @Since annotation for mllib.pmml Sub-task Resolved Xiangrui Meng  
          22.
          Update @Since annotation for mllib.random Sub-task Resolved Xiangrui Meng  
          23.
          Update @Since annotation for mllib.recommendation Sub-task Resolved Xiangrui Meng  
          24.
          Update @Since annotation for mllib.stat Sub-task Resolved Xiangrui Meng  
          25.
          Update @Since annotation for mllib.tree Sub-task Resolved Xiangrui Meng  
          26.
          Update @Since annotation for mllib.util Sub-task Resolved Xiangrui Meng  
          27.
          Add @Since annotation to ml.feature Sub-task Resolved Martin Brown  
          28.
          Add @Since annotation to ml.classification Sub-task Resolved Hiroshi Takahashi  
          29.
          Add @Since annotation to ml.clustering Sub-task Resolved Yu Ishikawa  
          30.
          Add @Since annotation to ml.evaluation Sub-task Resolved Tijo Thomas  
          31.
          Add @Since annotation to ml.attribute Sub-task Resolved Unassigned  
          32.
          Add @Since annotation to ml.param and ml.* Sub-task Resolved Hiroshi Takahashi  
          33.
          Add @Since annotation to ml.recoomendation Sub-task Resolved Tijo Thomas  
          34.
          Add @Since annotation to ml.regression Sub-task Resolved Ehsan Mohyedin Kermani  
          35.
          Add @Since annotation to ml.tuning Sub-task Resolved Yu Ishikawa  
          36.
          Add @Since annotation to ml.util Sub-task Resolved Ehsan Mohyedin Kermani  
          37.
          Add @Since annotation to ml.tree Sub-task Closed Hiroshi Takahashi  
          38.
          Add @since annotation to pyspark.mllib.classification Sub-task Resolved Noel Smith  
          39.
          Add @since annotation to pyspark.mllib.clustering Sub-task Resolved Noel Smith  
          40.
          Add @since annotation to pyspark.mllib.evaluation Sub-task Resolved Noel Smith  
          41.
          Add @since annotation to pyspark.mllib.feature Sub-task Resolved Noel Smith  
          42.
          Add @since annotation to pyspark.mllib.fpm Sub-task Resolved Yu Ishikawa  
          43.
          Add @since annotation to pyspark.mllib.random Sub-task Resolved Yu Ishikawa  
          44.
          Add @since annotation to pyspark.mllib.recommendation Sub-task Resolved Yu Ishikawa  
          45.
          Add @since annotation to pyspark.mllib.regression Sub-task Resolved Yu Ishikawa  
          46.
          Add @since annotation to pyspark.mllib.tree Sub-task Resolved Yu Ishikawa  
          47.
          Add @since annotation to pyspark.mllib.util Sub-task Resolved Yu Ishikawa  
          48.
          Add @since annotation to pyspark.ml.classification Sub-task Resolved Yu Ishikawa  
          49.
          Add @since annotation to pyspark.ml.clustering Sub-task Resolved Yu Ishikawa  
          50.
          Add @since annotation to pyspark.ml.recommendation Sub-task Resolved Yu Ishikawa  
          51.
          Add @since annotation to pyspark.ml.regression Sub-task Resolved Yu Ishikawa  
          52.
          Add @since annotation to pyspark.ml.tuning Sub-task Resolved Yu Ishikawa  
          53.
          Add @since annotation to pyspark.ml.util Sub-task Closed Yu Ishikawa  
          54.
          Add @since annotation to pyspark.ml.param and pyspark.ml.* Sub-task Resolved Li Hao  

          Activity

            People

              mengxr Xiangrui Meng
              mengxr Xiangrui Meng
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 19.5h
                  19.5h
                  Remaining:
                  Remaining Estimate - 19.5h
                  19.5h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified