[MAHOUT-1568] Build an I/O model that can replace sequence files for import/export - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Implemented
Affects Version/s: None
Fix Version/s: 0.10.0
Component/s: classic
Labels:
- scala
- spark
Environment:

Scala, Spark

Description

Implement mechanisms to read and write data from/to flexible stores. These will support tuples streams and drms but with extensions that allow keeping user defined values for IDs. The mechanism in some sense can replace Sequence Files for import/export and will make the operation much easier for the user. In many cases directly consuming their input files.

Start with text delimited files for input/output in the Spark version of ItemSimilarity

A proposal is running with ItemSimilarity on Spark and is documented on the github wiki here: https://github.com/pferrel/harness/wiki

Comments are appreciated

Attachments

Activity

People

Assignee:: Pat Ferrel

Reporter:: Pat Ferrel

Votes:: 1 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 01/Jun/14 17:25

Updated:: 31/Jan/24 22:14

Resolved:: 18/Mar/15 13:44