Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-2984

Support lenient parsing of SVMLight input files

    XMLWordPrintableJSON

Details

    Description

      The current implementation for the reader assumes that the format follows the exact specification.

      The splice-site Dataset dataset is formatted slightly different

      Example

      -1  1:0.381846 2:0.163648 3:0.245472 4:0.627318
      

      note the two spaces after the label.

      Currently MLUtils.scala splits on single spaces.

      Attachments

        Issue Links

          Activity

            People

              chiwanpark Chiwan Park
              jkirsch Johannes
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: