Uploaded image for project: 'SAMOA'
  1. SAMOA
  2. SAMOA-47

Integrate Avro Streams with SAMOA

    XMLWordPrintableJSON

Details

    • Patch

    Description

      The current SAMOA readers can only support data streams in ARFF format. Hence SAMOA as a distributed streaming machine learning framework is limited in scope since end users may have to transform their data to ARFF . Apache Avro is a data serialization system that handles data streams in compact binary format and is typically used in conjunction with with Big Data eco-system tools. Avro allows two encodings for the data: Binary & JSON. Hence an Avro support may allow users with JSON data also to use SAMOA seamlessly.

      The GOAL is to build support for Avro Streams into SAMOA by adding Avro File Stream Handler, Avro Loader to read records & transform to instances and a user option to switch between JSON/Binary encodings. The input format with representation of meta-data for both JSON/Binary data to be finalized along with build.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jayadeepj jayadeepj
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: