Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-873

Provide MapReduce job for creating Encoded Vectors from sequence files

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.6
    • Component/s: None

      Description

      Similar to SparseVectorsFromSequenceFiles, provide a version that can do encoded vectors. Start simple by handling basic text, but this could easily evolve to handle pluggable Vectorizer's that can better deal with features (numerics, etc.).

        Attachments

        1. MAHOUT-873.patch
          51 kB
          Grant Ingersoll
        2. MAHOUT-873.patch
          47 kB
          Grant Ingersoll
        3. MAHOUT-873.patch
          35 kB
          Grant Ingersoll

          Issue Links

            Activity

              People

              • Assignee:
                gsingers Grant Ingersoll
                Reporter:
                gsingers Grant Ingersoll
              • Votes:
                0 Vote for this issue
                Watchers:
                0 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: