Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1641

Add conversion from a RDD[(String, String)] to a Drm[Int]

    XMLWordPrintableJSON

    Details

    • Type: Question
    • Status: Closed
    • Priority: Major
    • Resolution: Implemented
    • Affects Version/s: 0.9
    • Fix Version/s: 0.10.1, 0.11.0
    • Component/s: spark
    • Labels:

      Description

      Hi.

      We are using the coocurrence part of mahout as a library. We get our data from other sources, like for instance Cassandra. We dont want to write that data to disk, and read it back since we already have the data on each slave.

      I have created some conversion functions based on one of the IndexedDatasetSpark readers, cant remember which one at the moment.

      Is there interest in the community for this kind of feature? I can probably clean it up and add this as a github pull request.

        Attachments

          Activity

            People

            • Assignee:
              pferrel Pat Ferrel
              Reporter:
              hamnis Erlend Hamnaberg
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: