Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-9664

FlinkML Quickstart Loading Data section example doesn't work as described

Agile BoardRank to TopRank to BottomAttach filesAttach ScreenshotVotersStop watchingWatchersCreate sub-taskConvert to sub-taskLinkCloneLabelsUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      The ML documentation example isn't complete: https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/libs/ml/quickstart.html#loading-data

      The referred section loads data from an astroparticle binary classification dataset to showcase SVM. The dataset uses 0 and 1 as labels, which doesn't produce correct results. The SVM predictor expects -1 and 1 labels to correctly predict the label. The documentation, however, doesn't mention that. The example therefore doesn't work without a clue why.

      The documentation should be updated with an explicit mention to -1 and 1 labels and a mapping function that shows the conversion of the labels.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            rongr Rong Rong
            manoswerts Mano Swerts
            Votes:
            0 Vote for this issue
            Watchers:
            2 Stop watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

              Estimated:
              Original Estimate - 1h
              1h
              Remaining:
              Remaining Estimate - 1h
              1h
              Logged:
              Time Spent - Not Specified
              Not Specified

              Issue deployment