Mahout
  1. Mahout
  2. MAHOUT-873

Provide MapReduce job for creating Encoded Vectors from sequence files

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.6
    • Component/s: None

      Description

      Similar to SparseVectorsFromSequenceFiles, provide a version that can do encoded vectors. Start simple by handling basic text, but this could easily evolve to handle pluggable Vectorizer's that can better deal with features (numerics, etc.).

      1. MAHOUT-873.patch
        35 kB
        Grant Ingersoll
      2. MAHOUT-873.patch
        47 kB
        Grant Ingersoll
      3. MAHOUT-873.patch
        51 kB
        Grant Ingersoll

        Issue Links

          Activity

          No work has yet been logged on this issue.

            People

            • Assignee:
              Grant Ingersoll
              Reporter:
              Grant Ingersoll
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development